Vol. 68, No. 1 


Psychological Review 


RicHarD L. So_tomon, Editor 
University of Pennsylwania 





CONTENTS 


Factorial Angles to Psychology 


Personality Theory and the Problem of Stability and Change in Indi- 
vidual Behavior: An Interpersonal Approach 
Pauz F. Secorp aNp Cart W. Backman 


Realism of Confidence Judgements 
Jor K. Apams AND Pautine Austin ApAmMs 


Application of Stimulus Sampling Theory to Situations Involving Social 
Pressure Patrick SUPPES AND FRANKLIN KRASNE 


An Analysis of GSR Conditioning 
M. A, Stewart, J. A. Sterx, G. Winoxur, anp S. Frepman 


Effects of Score Transformations in Q and R Factor Analysis Tech- 
i Donatp M, BroverMaNn 





PUBLISHED BIMONTHLY BY THE 


AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 





Rosert P. ABELSON 
E. JaMes ARCHER 
SoLtomon E. Ascn 
Atrrep L, BaLpwin 


Raymonp B. CATreti 


Wiiitam K. Estes 
Leon FESTINGER 
Eucent GALANTER 
W. R. GARNER 
Davin A. GRANT 
NorMAN GUTTMAN 
Hersert C. KeLMAN 
Joun I. Lacey 
GARDNER LINDZEY 


Editorial Consultants 





R. Duncan Luce 
Ricwarp D. MANN 
FrepeRICK MOSTELLER 
J. Trevor Perce 
Frank REsTLE 
Benzow F. Ritcuie 
Epcar H. ScHern 
Kennetu W. Spence 
Rocer W. Sperry 
SauL STERNBERG 
Joun A. Swers 
Joun W. Tuisact 
Lepyarp ‘R. TucKER 
Benton J. Unperwoop 


(A large group of psychologists has kindly agreed to assist in the editing of 
manuscripts submitted to the Psychological Review. In each issue the Editor 
will list in the Review those editorial consultants who have helped him process 
manuscripts for that issue, so that their efforts may be gratefully acknowledged.) 





The Psychological Review is devoted to articles of theoretical significance to 
any area of scientific endeavor in psychology. Except for occasional articles 
solicited by the Editor, manuscripts exceeding 7,500 words will not be accepted. 
Ordinarily manuscripts which consist primarily of original reports of research 
should be submitted to other journals. However, reports of original research may 
be included in a major theoretical article which attempts to integrate several related 
original studies. 


Because of the large number of manuscripts submitted, there is an inevitable 
publication lag of several months. Authors may avoid this delay if they are 
prepared to pay the costs of publishing their own articles; the appearance of arti- 
cles by other contributors is not thereby delayed. 


Tables, footnotes, and references should appear on separate pages; all of these, 
as well as the text, should be typed double-spaced throughout, in all manuscripts 
submitted. 


All manuscripts must be submitted in duplicate. 


Original figures are prepared for publication; duplicate figures may be photo- 
graphic or pencil-drawn copies. Authors should retain a copy of the manuscript, 
in case there is loss in the mail. Manuscripts should be addressed to the Editor, 
Richard L. Solomon, Psychology Department, College Hall, University of Penn- 
sylvania, Philadelphia 4, Pennsylvania. 


Aatuur C. Horrman Heten One 
Managing Editor Promotion Manager 


PustisHep BIMONTHLY BY THE 
AMERICAN PSYCHOLOGICAL ASSOCIATION, INC, 
Prince and Lemon Streets, Lancaster, Pa. 
1333 Sixteenth St. N.W., Washington 6, D. C. 


$10.00 volume $2.00 issue 


Second-class postage paid at Lancaster, Pa. 


Send all communications, including address changes, to 1333 Sixteenth St. N.W., Washington 6, D. C. 
Address changes must arrive by the 10th of the month to take effect the following month. Undelivered 
cepies resulting from address changes will not be replaced; subscribers should notify the post office that 
— will guarantee second-class forwarding postage. Other claims for undelivered copies must be made 
within four months of publication. 


© 1961 by the American Psychological Association, Inc. 


VoL. 68, No. 1 





JANUARY 1961 


PSYCHOLOGICAL REVIEW 


FACTORIAL ANGLES TO PSYCHOLOGY’ 


J. P. 


University of 


Very rarely has it been recognized 
that factor analysts have anything to 
contribute toward the resolution of 
general, systematic issues in psy- 
chology. It must be admitted at the 
beginning that there is no unified body 
of psychological theory, as such, that 
has been developed by factor analysts. 
What I shall say on the subject is the 
opinion of one person and cannot be 
represented as an expression for any 
group. 

The average factor analyst has delayed 
theory building because systematic 
thinking requires information concern- 
ing a rather large number of psycholog- 
ical factors before the most significant 
interrelationships become apparent and 
systems can take shape.*. The writer 
had felt no particular call to consider 
the relation of psychological factors to 
behavior theory until he recently faced 
the need to present in book form an 
organized picture of personality from 


1 Based upon a paper presented at the 
University of Florida, Gainesville, March 
9, 1959, as a part of a symposium on points 
of view in psychology. 

2An early attempt to develop anything 
like a system of psychology based upon 
factor analysis was that of Spearman (1923, 
1927). More recently Cattell (1957) has 
applied factorial findings to behavior theory, 
culminating in a rather elaborate presenta- 
tion. Neither has dealt with many of the 
major issues that have divided schools of 
thought. 


GUILFORD 


Southern California 


the point of view of factors (Guilford, 
1959a). As for the systematists in 
psychology not factor ana- 
lysts, probably none has regarded the 
method of factor analysis as a scientific 
tool by which psychological knowledge 
can be advanced very much; certainly 
not as a tool by which a comprehensive 
model of the behaving organism could 
be derived. 


who are 


Objectives of This Paper 

It is the purpose of this paper to 
show how factor theory and _ factor- 
analytic methods and results can pro- 
vide the models and the information 
upon which a comprehensive theory of 
behavior could be based. To this end, 
it will be necessary to present the 
logical foundation for factor analysis 
as a scientific method and to see how 
we can go logically from factors to 
the kinds of concepts that are more 
familiar to psychological theory. It is’ 
much too early to say that a system 
of psychology has been worked out 
from this point of view. All I can do 
is to indicate some of the implications 
of the knowledge of factors and their 
interrelationships for the understand- 
ing of the behavior of individuals and 
some of the more general ideas that 
follow from these implications. The 
conclusions have some bearing upon 
many of the perennial, systematic 


st Se ‘ 
Saat |» RR nae 
feeKikas Fabel 


2 J. P. Gu 


issues, such as that of associationism. 
Thus, some of the lines of a factorial 
system can be drawn, and certain com- 
parisons can be made with the major 
schools of psychology. 


Why Factors Have Not Contributed 
to Psychological Theory 

One of the reasons the findings of 
factor analysis have not contributed 
more to psychological theory is that 
many analyses have not been basic 
studies of the properties of human 
nature, either by intention or by way 
of outcome. One practical use of 
factor analysis is the reduction of cor- 
relational data, in order to transform 
them into simpler form for inspection. 
The chances of revealing anything of 
fundamental psychological _ interest 
under these circumstances are rather 
remote. 

Even when the objective is to dis- 
cover something of general interest 
regarding human nature, the outcomes 
are often disappointing. Many fac- 
torial studies of this type are poorly 
planned because they are forced to be 
simply exploratory. In this instance, 
there is not enough prior information 
as a basis for setting up fruitful hy- 
potheses. Even a single, prior ex- 
ploratory analysis in the same area of 
behavior would be of considerable 
help. Without good hypotheses there 
is limitation as to the wise choice of 
experimental variables. The choice of 
variables to be analyzed together is all 
important for success. We cannot go 
into the technical reasons for this 
statement here, but the well-worn 
cliché “You get out of a factor analysis 
what you put into it” does apply, if 
it is revised to say: “You cannot get 
out of an analysis what you do not 
put into it.” 

Another reason for some failures of 
factor-analytical studies is the lack of 
good experimental design. More will 





ILFORD 


be said about this point later, but I 
should like to add here that the fact 
that we have an elaborate computa- 
tional procedure for treating data does 
not relieve us of the necessity of ob- 
serving the ordinary requirements of 
experimental logic and experimental 
controls. 

But results from the better factor 
analyses are also commonly rejected 
as contributions to general psycho- 
logical knowledge. The reason is 
probably that the method has been 
most often used for the study of in- 
dividual differences. The fact that 
these studies have often been inspired 
more by the interests of vocational 
psychologists than by interest in basic 
facts of behavior is relatively coin- 
cidental. The basic data with which 
the factor analyst staris are measure- 
ments of numerous individuals in a 
large number of experimental vari- 
ables. In other words, the emphasis 
is upon individual differences. Fac- 
tors, when interpreted psychologically, 
are also conceived as ways in which 
individuals differ from one another. 
Psychological theorists, on the other 
hand, are primarily interested in how 
individuals are alike and only in- 
cidentally in how they differ. It is 
actually very easy to bridge the gap, 
but there has been little effort to do so. 

In this connection, the factorist 
could well charge the general theorist 
with being negligent for not bringing 
individual differences into the picture 
when he constructs a system of psy- 
chology. No well-known system, with 
the exception of Tolman’s purposive 
behaviorism, has anything of signifi- 
cance to say about individual differ- 
ences. Individual differences are rec- 
ognized as empirical facts, but they 
have been generally treated as nui- 
sances by the experimental psychol- 
ogist, and as phenomena of little or 
no interest by the theorist. The fact 





FACTORIAL ANGLES TO PSYCHOLOGY 


that reasonably satisfying 
could be built ignoring individual dif 
ferences is an important reason for not 
bringing them into the picture. 

One consequence of this general 
state of affairs is that the theorist who 
is concerned with personality often 
finds such systems inadequate for his 
purposes. Consequently he builds his 
own theory. But even such a theory 
often recognizes only implicitly the 
need for a significant emphasis upon 
individual differences. It rarely does 
justice to this aspect of the meaning 
of personality. Such a theorist 
ognizes that each person is unique but 
he does not sufficiently appreciate the 
fact that it is individual 
that make him unique. 


systems 


rec- 


differences 


Some Factor MopeELs 
PERSONALITY 


FOR 


Factor theory has provided the only 
rigorous models of any consequence 
that have been proposed to make in- 
telligible the many facets of the phe- 
nomena of individual differences. | 
shall mention three of those 
because they are especially relevant to 
this paper.? A multidimensional model 
is used to represent personality in 
general. A _ hierarchial model 
sents interrelationships among traits 
within typical individuals. A matrix 
model represents systematic relation- 
ships among factors within a certain 
domain of behavior. 


models 


repre 


A Dimensional Model 


The most common, basic model is 
a multidimensional Euclidian space, 
each dimension of which represents a 
unique trait. A unique trait can be 
discovered as a common factor by 
factor analysis. To the extent that an 
individual’s personality can be ac- 
counted for in terms of a_ limited 


3 Still other models have been presented 
by Guttman (1954) and Lazarsfeld (1954). 


w 











Fic. 1. Example of a dimensional model 
involving three unique traits, or common 
factors, A, B, and C, showing characteristic 
point positions of two individuals, P and Q, 
with respect to the three personality traits 


number of common factors, each per- 
son can be represented as a point in 
n-dimensional space. 

Figure 1 illustrates this kind of 
model, showing only three dimensions, 
for obvious reasons. Each axis is a 
linear dimension representing a unique 


trait along which individuals have 
characteristic trait positions. Let us 


say that axes A, B, and C represent 
the disparate traits of gregariousness, 
meticulousness, and emotional stabil- 
ity, with their positive extreme qual- 
ities at the labeled ends of the axes. 
So far as these three traits go, In- 
dividual P is described quantitatively 
by his projections on the three axes. 
His three projections define a point, 
which is his characteristic position in 
this three-dimensional space. The 
point for Person P tells us that he 
is a bit below: average in gregarious- 
ness and that he is very meticulous 
and also moderately strong in emo- 
tional stability, the average for each 
trait being at the origin. 

Person © has another combination 
of three projections, defining another 
point location in the same space. In 
n-dimensional space each person has 1 
different projections defining a point 
in that n-dimensional space. The fact 








4 |e 


that he probably changes his trait posi- 
tions somewhat from time to time need 
not concern us in_ this 
Such changes are the reason for the 
expression ‘characteristic position” 
used above. If it is difficult to think 
of a space of a large number of di- 


discussion. 


mensions, one can think instead of a 
profile chart, in which the dimensions 
are laid side by side. 

The fact that the shown in 
Figure 1 are orthogonal (at right 
angles) is incidental. Actually, fac- 
torists generally agree that the best- 
representing axes more often than not 
depart from right angles. They do 
how the angular 


2xes 


not agree as to 


separations shall be determined em- 
pirically. 
A Hierarchical Model 

Although the dimensional model 


just seen provides for the description 
of individuals, it is better as a rep- 
resentation of variables within popula- 
tions than as a picture of personality 
structures within individuals. There 
is also the circumstance that factor 
traits are of different degrees of gen- 
erality, in the sense that some are 
related to wide ranges of behavior and 
some to narrower ranges. 

A brief example will illustrate. Let 
us say that Student Z declined an 
opportunity to while taking a 
certain examination. This is a specific 
action; there is no necessary implica- 
tion concerning Z’s traits from this 
Over a run of 
card 
games, and purchases Z also passed 
up 18 opportunities to cheat. We may 
infer that over this range of behavior 
Z has a rather high position on a trait 
of resistance to cheating. So far as 
we know from these observations, the 


cheat 


one observable event. 


20 consecutive examinations, 


trait is not a very broad or general 


one. Further observations show that 


(GUILFORD 


Z’s conduct is similar in situations 
that offer opportunities for deceiving 
and stealing. From this information 
we may conclude that he is strong on 
a more general trait that we may 
call “honesty.” But Z has an even 
more general tendency toward ethical 
conduct of other varieties, which justi- 
fies our placing him above the average 
on a still broader trait of “strength 
of character.” Furthermore, he ex- 
hibits an even more general restraint 
and self-control that goes beyond be- 
havior that has ethical implications, 
suggesting a still broader disposition 
or syndrome. 

Secause of the appearance of traits 
at different levels of generality, a few 
writers feel the need of a second model 
to supplement the first (Burt, 1949; 
Eysenck, 1951; Guilford, 1959a; Ver- 
non, 1950). This is a_ hierarchical 
model, which serves to relate the fac- 
torial dimensions logically to one an- 
other and to a _ single personality. 
Figure 2 is an example of a segment 
of such a model. In the illustration 
of traits just given, the particular acts 
of resistance, in whatever area of be- 
havior—cheating, deceiving, or steal- 
ing—are at the specific-action level. 
Traits of cheating, or of 
or of stealing may be regarded as 
being at the “hexis” A hexis 
is about the same as a habit but the 
term “hexis”’ 
is not so committed to the idea that 
traits are determined by learning only. 
A trait of honesty, for which there is 
factor-analytical (Guilford, 
1959a), may be considered to be at the 
primary-trait level. Strength of char- 
acter and general self-restraint would 
be considered syndrome types at two 
still higher levels. 

Factor analysis can be applied at 
various levels in the hierarchy of traits. 
From information concerning inter- 
correlations of measured specific ac- 


deceiving, 
level. 


is preferred because it 


evidence 








FACTORIAL ANGLES TO PSYCHOLOGY 5 


Type level 


- . ez. 
os” s 4 “Ss 
rd s ¢ i . 
7 s 4 ; ‘ie 
s * d ' . 
NU _ 
Primary-trait level | 

Y%, “> gk) ~ s ~ 
aft bs t's he di ee f Ara, as, 
os ft yas sor wes ~~ ws 

Of s Ie rd / ‘\ ~s of, of Lo” 3 “4 - ~ %f / » 
4 ~ ‘ 
‘Le bi ‘2S ts or” % »~ ie ‘ “f / _ 


Hexis level 


‘oh at 


| {All 


Specific-action 
level 


(lh 


Fic. 2. 


y a Lr ee, \ 
a\b\c ; elfig| Ali 
MINIM et. 


Segment of a hierarchical model representing an area of personality, with three 


“ 5 s *, 
/\A { nolo] a ris ‘lu / 


levels of traits of differing generality and the specific-action level. 


tions we can derive conclusions re- 
garding the existence of unique traits 
at the hexis level. From information 
concerning intercorrelations of trait 
positions of individuals for traits at 
the hexis level, we can derive conclu- 
sions regarding the existence of unique 
traits at the primary-trait level; and 
sO on, Probably 
most of the psychologically meaningful 
factors thus far discovered have been 
at the primary-trait level. 
Many of the apparent 
ments in results from different analysts 
should be attributed to the fact that 
they are analyzing at different levels 
in the hierarchy. The kind of factors 
one obtains depends upon the level 
or levels at which he obtains his in- 
formation concerning individuals. In 
some analyses, factors of different 
levels are undoubtedly confused. From 
the standpoint of clarity of results the 
best general strategy would have been 
to start with information from the 
lower levels and work upward. Be- 
fore we can collect clear evidence re- 
garding status of individuals on unique 
traits at a certain level, we need to 


for higher levels. 


disagree- 


know what are the unique traits at 


that level. 
The Matrix Type of Model 

The third model is of recent origin 
and comes about from attempts to 
discern logical interrelationships among 
the known factors. Since the factors 
concerned are probably at the pri- 
mary-trait level, the resulting models 
apply within that level. The results 
at the primary-trait level reveal organ- 
izations of the factors that suggest re- 
lations to higher levels. Hence the 
general outcome of this kind of model 
building may serve as guiding hy- 
potheses concerning the completion of 
the hierarchical picture of personality. 
The best evidence concerning inter- 
relationships of the levels will be em- 
pirical information regarding inter- 
correlations of factors. As yet we have 
no adequate experimental procedures 
for estimating these correlations. 

Attempts have been made to classify 
the known factors in certain areas of 
personality, with some success in a 
few of them, including the psycho- 
motor abilities, the intellectual abilities, 








6 J. P. GuiILrorp 


the temperament traits, and some of 
the simpler traits of pathology (Guil- 
ford, 1959a). The typical outcome is 
a matrix of factors that places them 
in columns and rows according to their 
common properties. That is, the fac- 
tors of each row have something in 
common and the factors of each col- 
umn. In the case of the intellectual 
abilities, of which about 55 are known 
at the present time, a third dimension 
is necessary. They have been organ- 
ized in a solid figure, known as the 
structure of intellect, which is shown 
in Figure 3 (see Guilford, 1959b). 
For our present purposes, we need 
to note only the general features of the 
model of the structure of intellect. 
First, the factors are classified accord- 
ing to the kind of operation per- 
formed: cognition, memory, divergent 
production, convergent production, 
and evaluation. Second, the factors 
are classified according to the kind of 
content or material on which the opera- 
tions are performed—figural, symbolic, 
semantic, and behavioral. Of the four 
content categories we might say that 


the first represents concrete intelli- 





Fic. 3. 


structure 


model of the 
of intellect, representing categories of pri- 


Cubical 


mary abilities with respect to three modes 


f variation. 


,in each 


gence, the next two represent two 
varieties of abstract intelligence, and 
the fourth represents social intelli- 
gence. The latter has been included 
on a purely theoretical basis; no fac- 
tors of social intelligence are as yet 
known. The third basis of classifica- 
tion is in terms of the kinds of prod- 
ucts resulting from the operations on 
the contents—wnits, classes, relations, 
systems, transformations, and implica- 
tions. Theoretically, there should be 
a different primary intellectual ability 
cell of Figure 3, in other 
words, about 120 instead of the 55 
presently recognized. 

We shall have occasion to refer back 
to the structure of intellect, for I shall 
draw heavily upon the principles that 
it demonstrates and upon the implica- 
tions drawn from it in connection with 
discussion of general 
theory. This is so, not only because 
intellect looms large among the things 
accounted for but also because 
our knowledge from the factorial ap- 
proach is extensive in this area of 
personality. 


psychological 


to be 


Factor ANALYsIs AS A TOOL 
IN SCIENCE 

Factor analysis is a potent instru- 
ment for extracting information from 
data, but it has no magical power to 
reveal anything regarding information 
that is not inherent in the data. The 
scientist who uses the method for the 
discovery of psychological information 
should start out by asking certain 
questions before he collects his data. 
In other words, a factor-analytic study 
must be carefully planned, with clearly 
stated hypotheses to be tested, if one 
is to use the method effectively. 

The more experimental features the 
investigator can bring into his plan- 
ning the better. This practice reduces 
ambiguity in the interpretation of re- 
sults. In evaluating the scientific use 








FACTORIAL ANGLES TO PsyYCHOLOGY 7 


of factor analysis, therefore, we need 
to be clear as to the kinds of hypotheses 
that can be set up, the ways in which 
those hypotheses are tested, the kinds 
of variations of conditions introduced, 
and the kinds of controls exerted. 
Many psychologists are probably un- 
aware that these experimental features 
are possible in factor-analytical in- 
vestigations, perhaps because in publi- 
cations regarding the method so much 
attention is given to computational 
steps and so little to experimental uses 
of it. 

A Typical Factorial Study 

Let us illustrate an experimental 
design for a factorial investigation by 
reference to a study of intellectual 
abilities. Suppose we have selected for 
study the area of problem solving. 
Suppose that we had no prior in- 
formation regarding primary abilities 
involved in problem solving. Our 
preliminary thinking about problem 
solving would first probably suggest 
two alternative hypotheses. One 
would be that there is a single, uni- 
versal ability to solve problems and 
the alternative would be that there 
are a number of distinctly different 
abilities involved. If these were the 
only hypotheses that we wished to 
test in our first analysis, we would 
simply select a list of tests, each of 
which presents problems of a different 
type. The types would be varied con- 
siderably, covering a great range. The 
analysis should give us the answer as 
to whether individual differences in 
success in solving these many different 
kinds of problems can be accounted 
for on the basis of one common fac- 
tor or whether we are led by the 
results to reject this idea. 

Let us say that the results indicate 
the acceptance of the multiple-factor 
hypothesis to account for problem 
solving ability. The next study would 


normally be directed toward answer- 
ing the question of how many factors 
are needed and the question of the 
properties of each factor. The hy- 
potheses set up in the second investiga- 
tion would probably involve a predic- 
tion that a certain number of factors 
is needed and that each expected fac- 
tor has certain properties. The in- 
vestigator would be aided considerably 
in this hypothesis formation from the 
way in which the tests were classified 
among the factors in the first analysis. 
But there might be ambiguities re- 
garding the properties of each factor. 
Hypothesized factor Q might be ex- 
pected to have properties a, d, and f, 
or it might have properties c, d, and f, 
or even properties a, c, and g. From 
the hypothesized properties arise ideas 
as to the kinds of tests that will be 
needed to measure the factor. Ex- 
perimental tests are accordingly con- 
structed, differing systematically along 
the lines of the supposed properties, 
with the hope that when a hypothesized 
factor does come out in the analysis, 
tests with one kind of specification will 
be found related to it and tests with 
the other specifications will not. In 
this manner we hope to obtain clarifica- 
tion of the nature of a factor. 


Statistical Tests and Factor Analysis 


At this point the rigorously inclined 
investigator is quick to remind us that 
there are no adequate statistical tests 
that can be applied, either to help us 
decide how many common factors ob- 
tain in the particular analysis or 
whether factor loadings are signifi- 
cantly different from zero or from one 
another. There is no use denying this 
state of affairs. It is also true that 
there is some degree of looseness con- 
nected with rotational procedures, 
which also contributes to the uncer- 
tainty of conclusions. Conclusions 
must therefore be made on a permis- 





J. P. 


sive rather than a compulsory basis, 
in spite of the fact that an accepted 
rotational solution appears to be com- 
pulsory.* 
we can only say that yes, we may 
tolerate the idea that there is a unique 
psychological trait having such and 
such properties, in line with prelimi- 
nary hypotheses, or we may say that 
no, we cannot reasonably do.so. In the 
latter event we may conclude that the 
factor has other properties not sus- 
pected before and a new analysis is 
called for. 

The reaction of some psychologists 
to this state of affairs may be to re- 
ject the results from factor analysis 


To be completely objective, 


completely. To such a person it can 
be said that much more important to 
a scientific psychologist than statistical 
tests are psychological ideas. Sciences 
generally, including psychology, orig- 
inally developed and went a long way 
without the aid of statistical 
The lack of statistical tests is not fatal, 
but the lack of‘ideas would be. This 
is not to say that we should not wish 
to have both if we But there 
should be no prohibition against ap- 
plying a method that seems fruitful, 
while waiting for statistical tests to 
I suggest that we let 


tests. 


can. 


be developed. 
the results of factor analysis speak for 
themselves and that they be judged on 
the basis of how much they contribute 


to psychological understanding, pre- 
diction, and control. 


*A number of proposals have been made 
for rotation according to rigorously de- 
fined criteria by analytical methods. Al- 
though some of them are very attractive in 
principle, they usually fail to achieve 
satisfactory solutions from an intuitive point 
of view. The difficulty is that the selection 
of experimental must be nearly 
ideal in order to achieve the “right” solu- 
tion from an interpretive standpoint 

5 Some made 


certain 


fully 


variables 


progress is being with 


regard to statistical tests aspects 


of factor-analytical results Maxwell, 


1959). 


(see 


GUILFORD 


Experimental Controls in Factor 
Analysis 


To return to other experimental 
features of a factorial study, what con- 
ditions are systematically varied? The 
most conspicuous kind of variation 
is in the types of tests, or other ex- 
perimental variables, used. For the 
most part these variations are qualita- 
tive. From test to test we find varia- 
tions in kind of material—figures, 
pictured objects, letters, numbers, and 
words being the most common vari- 
eties in printed form. We find varia- 
tions in item format—multiple-choice, 
matching, completion, and their deriva- 
tives. We find variations in instruc- 
tion as to what is to be done and how 
it is to be done. In some of our more 
recent analyses we have introduced a 
few quantitative variations, such as 
the number of restrictions of a certain 
kind, the number of responses called 
for per item, and difficulty level of 
tests with the same kind of items. In 
connection with the quantitative varia- 
tions we sometimes predicted that the 
factor loading for a certain factor 
would rise systematically and that for 
another factor would fall 
atically, or that there would be an 
intermediate optimal level for the vari- 
ation concerned. A great deal has to 
be known about the nature of a factor 
before this much refinement in ex- 
perimental variation can be effectively 
introduced. 

Certain conditions are at least par- 
tially controlled, or should be. The 
selection of population of individuals is 
important. In the study of intellectual 
abilities, there should be relative ho- 
mogeneity in age, education, sex, and 


system- 


general intellectual level. Ideally, one 
should like to equate individuals on 
all other factors than those under in- 
this would be 


vestigation, but pro- 


hibitive in terms of effort, and fortu- 








FACTORIAL ANGLES TO PSYCHOLOGY 9 


nately it is not essential We can 
tolerate incomplete controls, as must 
be true of any experiment, allowing 
the effects to go into the error com- 
ponents. Factor analysis has a way 
of segregating error variance from 
the variances in which we are in- 
terested, not perfectly, but within 
practical limits. So long as the non- 
error variance is of sufficient potency, 
we can obtain a fair picture of the 
common factors. 

From Factors To FUNCTIONS 

Our next need in this discussion is 
to establish a logical bridge between 
factors and general psychological the- 
ory. Present psychological theory has 
arisen mainly in connection with 
the traditional experimental approach, 
hence in constructing the bridge we 
should profit by giving some attention 
to a comparison of that approach with 
the factor-analytical approach, in cer- 
tain respects. 


Individual Differences vs. Functioning 

Individuals 

The main operational difference is 
that the traditional experimentalist 
focuses his interest upon stimulus- 
response relationships. The stimulus- 
response model is basic to his thinking 
and to his planning of investigations. 
The factorist, on the other hand, di- 
rects his attention primarily upon re- 
sponses and the concomitances among 
responses. From the intercorrelation 
of response values the factorist looks 
for signs of traits, not for stimulus- 
response relations. Traits prop- 
erties of individuals. To the student 
of personality, their determination of 
behavior is just as real a phenomenon 
as the determination by stimuli. Some 
experimentalists are now recognizing 
this general principle, as indicated in 
studies of the relation of characteristic 
anxiety levels to various measured ef- 


are 


fects in behavior. In such an experi- 
ment, one independent variable is not 
a stimulus variable or even a transi- 
tory personal variable such as a mental 
set. It is individual differences on 
an enduring attribute, in other words 
a trait. 

The key to the bridge, therefore, is 
the concept of “trait.” In the context 
of personality theory, a trait is any 
relatively enduring way in which one 
person differs from others. On a 
scalable trait, which can be represented 
by a straight line, each person has a 
characteristic position. If individuals 
have different positions on a common 
scale, the scale represents some quality 
or property that each person possesses 
to some degree, in common with other 
persons. If the quality is a unique 
one, such as may be discovered by 
factor analysis, it is some significant 
component of the individual’s con- 
stitution. 

There is nothing in this line of 
reasoning to force us to any conclusion 
as to the fundamental nature of this 
component. It might be a unique skill, 
a unique motive, or a unique attitude 
of some kind. It might be dependent 
upon some particular organic structure 
or upon some combination of structures 
functioning together. There is nothing 
in ordinary factor-analytic results to 
tell us about the origins of a factor 
or to give us sufficient basis for its 
automatic classification. 

Traits and Functions 

If we consider the primary abilities 
represented in the structure of intellect 
(Figure 3), however, and ask our- 
selves what kinds of components they 
are, we obtain a little help. Examina- 
tion of the list of intellectual abilities 
tells us that represented in one partic- 
ular cell is an ability to cognize (dis- 
cover, know, or rediscover) semantic 
systems and that this is distinctly dif- 

' 








10 J. P. GuILrorp 


ferent from an ability to evaluate the 
logical soundness of conclusions in- 
volving relations between symbols. If 
each person has a characteristic level 
of ability to perform in each of these 
two respects, he certainly is perform- 
ing in these two ways. In other words, 
we may say that he has functions of 
these two types. The primary intel- 
lectual abilities may therefore be con- 
ceived as ways of functioning within 
individuals as well as ways in which 
individuals differ from one another. 
Thus, by the study of how persons 
differ we also discover how they are 
alike. 

Traits other than abilities are not 
so readily conceived as functions, but 
it is possible to see that some of them 
are the directions that the functioning 
individual takes (in the case of motiva- 
tional traits) and others are connected 
with the manner in which the function- 
ing proceeds (in the case of tempera- 
mental traits). As in the case of 
abilities, other primary traits should 
give us ideas about the functioning of 
the individual as a going concern. 
Information regarding the factors and 
their properties should therefore help 
us to build up a picture of the 
behaving organism, and hence con- 
tribute to psychological theory. 


SoME IMPLICATIONS OF FACTORS FOR 
PSYCHOLOGICAL THEORY 


It has just been suggested that from 
the psychologically meaningful factors 
that we know, and from their inter- 
relationships, we can make progress 
toward the general description of be- 
havior of individuals and account for 
the determination of behavior by in- 
ternal sources. The primary traits 
that are recognized as needs, interests, 
and attitudes, provide potentially use- 
ful concepts for understanding the 
varieties of general motives operating 
within the individual. Forty or more 


years ago it was common for psy- 
chologists to draw up lists of instincts. 
In later years this kind of exercise has 
been replaced by a somewhat less com- 
mon effort to draw up a list of funda- 
mental human motives. McDougall, 
for example, took this kind of ob- 
jective very seriously. If the point 
of view adopted in psychology calls 
for such an analysis of motivation into 
component urges, propensities, needs 
—call them what you will—factor anal- 
ysis offers a very natural empirical 
method for arriving at such a goal. 





Information, Operations, and Products 


From the intellectual factors as 
organized in the structure of intellect, 
some of the most significant implica- 
tions for psychological theory may be 
seen at this time. The first general 
principle is that we should conceive of 
a human being as an instrument or 
agent whose psychological purpose is 
to deal with information. In saying 
this, it is necessary to interpret in- 
formation in a very broad sense. For 
our present purposes, “information” 
may be defined as anything that the 
individual discriminates. He need only 
show that he discriminates or can do 
so by means of differential reactions 
of a sufficiently consistent character. 

This principle regarding informa- 
tion and the definition of information 
do not follow directly from the struc- 
ture of intellect, but they are strongly 
suggested by it. The structure of intel- 
lect does indicate that the individual 
cognizes, which means that he dis- 
covers or rediscovers or recognizes in- 
formation. The information may be 
in figural form, which is a form that 
can be perceived through the senses. 
It may be in symbolic form, which 
may involve letters, digits, or other 
conventional signs. It may be in the 
form of meanings, ideas, or concepts, 
or what I have called semantic content. 








FACTORIAL ANGLES TO PSYCHOLOGY 11 


If the behavioral column is appro- 
priately included in the structure of 
intellect, the information may be in 
terms of such things as the intentions, 
feelings, and attitudes of other individ- 
uals and of ourselves. 

The individual not only cognizes all 
these kinds of things, he also retains 
information for future use. According 
to the structure of intellect, for every 
cognitive ability there should be a cor- 
responding ability to retain informa- 
tion. Eight such memory factors are 
known. The individual also generates 
new information from given informa- 
tion, in accordance with the two cate- 
gories of productive operations—diver- 
gent and convergent. Finally, the in- 
dividual evaluates information, whether 
it is merely cognized or whether re- 
membered or produced from other in- 
formation. He decides whether the 
information is correct, adequate, suit- 
able, and so on. A negative decision 


provides the occasion for new activity 


involving the same kinds of operations, 
with the individual probably arriving 
at new products. 

The kinds of products achieved are 
as indicated by the six categories of 
factors known at present. Units are 
relatively segregated items of informa- 
tion and they are sometimes articula- 
tions or integrations of parts. The 
latter characteristic is best seen in the 
symbolic column of factors, in con- 
nection with which units may be in the 
form of syllables, words, or number 
combinations. By virtue of common 
properties among the units, classes can 
be formed. Units also enter into re- 
lations with one another, for example, 
relations stich as larger than, attached 
to, or inside of. One might think that 
systems are merely complex sets of 
relations, but analysis shows that abili- 
ties needed to cope with them are dif- 
ferent from the abilities needed to cope 
with relations. Transformations in- 


clude changes of various kinds, for ex- 
ample, changes in spatial order or 
arrangement, in letter combinations, 
and in the uses or functions of objects. 
[Implications are in the form of extra- 
polations from given information, such 
as extensions of the information in 
space or in time. Thus, the six kinds 
of products seem to pertain to varie- 
ties of knowledge when knowledge is 
reduced to elemental terms. 


An Informational Theory of Learning 


A psychological system should pro- 
vide the principles by which the facts 
of various areas may be ordered and 
comprehended. As an example of 
how the factorial view would account 
for an area of behavior, let us consider 
the subject of learning. Each point of 
view should have something to say 
regarding this most important subject. 
What kind of theory of learning would 
be consistent with the known out- 
comes of factor analysis or would be 
implied by this information ? 

Taking our cue from the structure 
of intellect, we find that the theory of 
learning suggested is in the general 
category of cognitive theories. Fur- 
thermore, from the list of cognitive 
factors we see the classes into which 
such cognitions fall. Learning, then, is 
essentially discovery. The particular 
discovery may be achieved completely 
and suddenly, an instance of learning 
that has been called “insight,” or it 
may be achieved in a series of ap- 
proximations. The discovery may be 
in any of the six forms recognized as 
products in the structure of intellect. 
The achieved cognition may be an 
acquaintance with a new unit, the for- 
mation of a new class, the formation 
of a new relationship or a new system, 
the awareness of a transformation, or 
the extension to new implications. 

It is possible to give an associative 
interpretation to some of these events 











12 J. 


but not all. Perhaps the most obvious 
type of product that may be interpreted 
as an association is that of relation. 
3ut when we recognize that two things 
bear some relationship to one another, 
it is not just that when one comes we 
think of the other. The relation itself 
is something additional. It may be a 
difference along some continuum, as 
when we say that X is taller than Y, 
P is older than Q, or J is softer than 
K. The relation might be in terms of 
spatial or temporal position. Or it 
might be in terms of opposition, friend- 
ship, status, or of whole and part. 
One particular relation may be of very 
general importance, namely, the re- 
lation to the effect that “this belongs 
with that” merely because the two 
things occur in temporal or spatial 
proximity. There may be no other 
basis, for example, in terms of prop- 
erties of the two things, for justifying 
the formation of a relationship. 

It is possible to apply the concept 
of association in connection with the 
cognition of classes. Classes can be 
formed because units have- something 
in common. We could say that Aris- 
totle’s concept of association by simi- 
larity applies. But being aware that a 
class exists and knowing a class con- 
cept would seem to go well beyond a 
mere association by virtue of the simi- 
larity of two things. 

The more modern interpretation of 
class recognition brings such an event 
under the concept of “equivalence of 
stimuli.” But there 
genuine psychological difference be- 
tween failing to discriminate stimuli, 
on the one hand, and cognizing that 
discriminated stimuli have properties 
in common and therefore belong in 
the same class, on the other. We could 
think of all such events, of course, as 
belonging on a continuum of lower to 


seems to be a 


higher levels of cognition of ¢lasses. 
The implication of this discussion is 


P. GUILFORD 


that class recognition, however primi- 
tive or sophisticated, is still a unique 
kind of psychological event. 

A third possible place for applying 
the associative principle is in con- 
nection with the product of implica- 
tion. One of the natural conditions 
for developing a cognition of implica- 
tion would seem to be temporal se- 
quences of stimulations. It 
likely that what some learning the- 
orists refer to as “expectations” come 
within the category of implications. 

In connection with the other three 
kinds of products—units, systems, and 
transformations—the association prin- 
ciple entirely inappropriate. 
Wundt and others failed in their at- 
tempts to account for perceived ob- 
jects in terms of association. An im- 
portant natural consequence was the 


seems 


seems 


development of Gestalt psychology. 
Units, systems, and transformations 


all pertain to things that have totality 
properties that defy associative inter- 
pretations. As a matter of fact, it can 
be maintained that even in the prod- 
ucts of relations, classes, and implica- 
tions, totality properties are also in- 
volved. Two things in relation con- 
stitute a kind of unitary structure, as 
Spearman pointed out. 
conceived groups of units and to each 
group a single class concept is applied. 
The emergence of a class 
hardly to be accounted for as the for- 
mation of an association. In connec- 
tion with implications we often have 
continuities of various kinds. Since it 
is possible to conceive of totality prop- 
erties for all six kinds of products, it 
is small wonder that the Gestalt psy- 
chologists felt considerable confidence 
in applying their principles to all kinds 
of mental functioning, with complete 
discard of the principle of association. 

As a particular example of human 
learning, let us take the case of mem- 
orizing a series of nonsense syllables, 


Classes are 


idea is 








FACTORIAL ANGLES TO PSYCHOLOGY 13 


so popular traditionally in the labora- 
tory. From the factorial point of 
view, each syllable is a symbolic «unit, 
and the learner becomes more and 
more familiar with it as exposures are 
repeated. In learning to associate each 
syllable with the one following in the 
list, the learner, we may say, is seeking 
for and perhaps discovering a relation 
(or relations) where none existed be- 
fore; a relationship may be formed in- 
volving two units of information. Two 
syllables offer very little in the way 
of properties that can be utilized for 
this purpose, as compared with words. 
But each pair of syllables does offer 
some possibility of the formation of 
some kind of relationship. 

Another aspect may be that the 
learner is acquiring new expectations 
or implications. This may be in- 
dicated by the fact that forward con- 
nections are stronger than backward 
associations. We may also hypothesize 
that the learner is developing a new 
system of units. At least two primary 
abilities that we know indicate that 
order is an important kind of system. 
Some indication that a system is being 
formed can be seen in what are com- 
monly called remote forward associa- 
tions and backward associations. The 
beginning and end units of the series 
df syllables seem to serve as anchoring 
points for the whole system and hence 
much to do with the relative 
learning units in different 
positions in the series. 


have 
ease of 


This line of reasoning suggests that 
in memorizing a list of syllables sev- 
eral of the primary abilities come into 
play, including the cognition of sym- 
bolic units, of symbolic relations, of 
symbolic systems, and of symbolic im- 
plications. Since the testing of prog- 
ress in learning involves lapse of time, 
there is probably also involvement of 
the corresponding memory factors. If 
the series were composed of figures or 


of words, the corresponding cognitive 
and memory factors in the figural and 
semantic columns, respectively, should 
be involved. Still other intellectual 
resources may come into play, depend- 
ing upon the strategy of the learner. 
The hypotheses concerning the roles 
of various common factors in a par- 
ticular learning task can be tested by 
factor analysis of scores obtained at 
different stages of learning along with 
sufficient marker tests for the factors 
probably involved. Fleishman and 
Hempel (1954, 1955) have already 
conducted studies of this kind. 
Reinforcement 

Most theories of learning have some 
provision to account for the fact that 
practiced changes in behavior have 
some degree of permanence, in other 
words, some concept to account for 
what is called “fixation.” The con- 
cepts usually invoked for this purpose 
are some form of a law of effect or of 
reinforcement. In an informational 
theory of learning based upon the 
structure of intellect, the key to fixa- 
tion is to be found in the category of 
evaluation. In this connection, feed- 
back information is very important. 

Thorndike seemed to favor the in- 
terpretation that effect is in the form 
of a confirming reaction, a conception 
that seems close to that of evaluation. 
The idea of confirming reaction is not 
comprehensive, however, because it 
does not give significance to failure of 
confirmation. In stressing the positive 
aspect of evaluation, Thorndike was 
reflecting the experimental findings, 
as he thought, that only positive effect 
seemed to change the strength of 
learned connections. He did recognize 
that a failure leads to new learning 
activity, which is equivalent to saying 
that an evaluation of failure is the oc- 
casion for new learning effort. 

The conception of an_ evaluative 








14 MB 


aspect to learning is broad enough to 
embrace most current interpretations 
ot effect. When primary reinforce- 
ment is equated to drive reduction, we 
can say that the organism is using 
feedback information regarding his in- 
ternal condition. Secondary reinforce- 
ment may be said to involve implica- 
tions cognized by the organism. Lest 
these statements raise objections that 
there cannot be unconscious cognitions 
and evaluations, as a sort of contradic- 
tion of terms, let us refer back to the 
definition of information that was given 
earlier—information is that which the 
organism discriminates. The conscious 
unconscious distinction is therefore 
irrelevant. 

The evaluative view of effect just 
expressed suggests the extension of 
the application of the concept of in 
formation to the phenomena of motiva- 


tion and emotion generally. This ex- 
tension leads to some new ways of 


looking at old problems. Both motives 
and emotions, including feelings, can 
be regarded as information concerning 
the internal conditions of the organism ; 
concerning its needs and its states of 
well being or ill being. 

In this connection we need to rec- 
ognize another property of informa- 
tion. When motives become at all 
strong, there is a sense of urgency 
about them and about the accompany- 
ing emotional aspects. This suggests 
the need for the concept of “loading.” 
We can say that information carries 
different degrees of loading or weight. 
Feelings of pleasure and of displeasure 
are in themselves information that is 
more or less loaded, positively or 
negatively. There is increasing evi- 
dence that in themselves these feelings 
can be reinforcing (Miller, 1955). 
They are not infallible: information 
and evaluation may sometimes go 
against them because they are out- 
weighed by other information, but they 


(GUILFORD 


nevertheless can be regarded as kinds 
of evaluative information. In this con- 
nection, it may be of more than in- 
cidental significance that in the work 
of Osgood and others on the dimen- 
sions of meaning, the pleasant—un- 
pleasant or evaluative dimension seems 
to be the strongest and most universal 
(Osgood, 1957).° 

Another thought on “loading” would 
be that the basis for it is to be found 
in the behavioral column of the struc- 
ture of intellect. Information regard- 
ing our own internal conditions and 
our relation to external situations is 
evaluated as being vital to some de- 
gree. The degree of recognized vital- 
ness determines the loading or is the 
loading. We are thus brought to the 
conclusion that loading is a matter of 
evaluation and that it may apply not 
only to behavioral products but quite 
generally. 


, 


Transfer 

The general problem of transfer of 
learning takes on a different com- 
plexion in view of our information 
concerning factors. G. A. Ferguson 
(1954) has proposed the unique theory 
that factors of ability are produced by 
the fact that transfer of learning takes 
place along the lines of the common 
factors. It has long been recognized 
that psychological similarity of tasks 
is a key to the direction and extent of 
transfer. Similarity of tasks depends 
upon common kinds of content, opera- 
tions, and products, or of combinations 
of these features. We interpret com- 
mon factors in terms of the common 
properties of the tests that indicate 
them. 

®It is also sig:ificant that J. P. Seward 
(1958) has recently assembled considerable 
evidence in support of the cognitive inter- 
pretation of learning and also that there is 
evidence pointing toward need satisfaction 
that arises merely from acquiring new 
cognitions. 























FACTORIAL ANGLES TO PSYCHOLOGY 15 


Ferguson’s theory does not neces- 
sarily mear. that all the effects of learn- 
ing are general. It is possible ,to 
deduce from factor theory, as applied 
to abilities, the principle that each in- 
stance of learning has both general 
and specific components. Fleishman 
and Hempel (1954, 1955) have pro- 
vided evidence to support this prin- 
ciple from more than one investigation. 
Each time, they found a specific com- 
ponent, whose relative importance for 
individual differences systematically in 
creased with practice, in addition to 
the common-factor components each 
of which tended to undergo some sys- 
tematic changes in importance. The 
common factors were identifiable with 
those found in testing apart from the 
learning experiment. 

Although the Fleishman and Hempel 
experiments give us no information 
regarding progressive improvements 
in any of the common factors, they 
demonstrate that common factors do 
play roles in learning and they find no 
evidence of the existence of a unique, 
general learning-ability factor. These 
and other factorial learning studies 
should nail in its coffin for all time 
the notion that there is a single, gen- 
eral learning ability. Abilities to learn 
are apparently at least as numerous 
as there are intellectual factors. The 
writer often feels that psychologists 
would do well to banish the term 
“learning” entirely and instead talk 
about changes in behavior, their con- 
ditions and consequences. 

Factors in Problem Solving 

The same fate is in store for the 
notion that there is a single problem- 
solving ability. As I have pointed out 
elsewhere (Guilford, 1954), the vari- 
ety of problems is enormous. Each 
type of problem, like each type of 
task, is presented in the form of a 
certain kind of content, it calls for 


its own kinds of operations, and cer- 
tain kinds of products are needed in 
order to achieve a solution. <A prob- 
lem exists for an individual when to 
cope with a given situation he has no 
adequate, previously learned and re- 
membered device ready to deal with 
the situation. From this definition we 
see that problem solving is as broad 
as behavior itself. It is likely, there- 
fore, to tax any of the resources of 
intelect that may be pertinent. 

Fach type of problem should draw 
upon its own pattern of resources in 
the average person in a certain popula- 
tion. The importance of each factor 
for individual differences in problem- 
solving scores can be indicated by a 
factor analysis of those scores along 
with scores from appropriate marker 
tests of the factors. In our Aptitudes 
Project at the University of Southern 
California,’ we have just completed 
this type of study for one type of 
problem. Five factors appeared to be 
of some importance: sensitivity to prob- 
lems, verbal comprehension, ideational 
fluency, originality, and conceptual 
foresight. The conditions, as rep- 
resented by the problem tasks and 
the intellectual status of the subjects 
and their strategies, were evidently 
right for emphasizing these five factors. 


Motor and Sensorimotor Aspects of 
Learning 


Many readers will recognize that 
the account of learning is limited in 
a number of ways, because only the 
intellectual factors have been con- 
sidered, and it needs certain supple- 
mentations. The theory of learning 
presented is not considered a complete 


7 Under Contract Nonr-228(20) with the 
Office of Naval Research, monitored by the 
Personnel and Training Branch. This 
particular study was under the direct super- 
vision of Philip R. Merrifield (Merrifield, 
Guilford, Christensen, & Frick, 1960). 














16 5. & 


one by any means. There are ques- 
tions of how sequences of cognitions 
and actions are developed and how 
complex motor patterns are organized 
as a consequence of activity. Two 
individuals who have the same cogni- 
tion may not do the same thing or do 
it in the manner. Adequate 
motor responses are not always avail- 
able on demand and new ones must 
often be developed. 

From a cognitive point of view, 
sensorimotor learning can be con- 
ceived as the formation of cognition 


same 


response sequences instead of stimulus- 
response If we knew the 
individual's cognitions, we should be 
better able to predict his responses 
than we can from knowledge of the 
He responds 


sequences. 


stimulating conditions. 
overtly to his cognitions rather than to 
the experimenter’s “real” world. For 
a long time the Gestalters have been 
attempting to convince us of the valid- 
ity of this principle. If the learner’s 
cognitions nearly coincide with the 
experimenter’s, of course, the experi- 
menter’s knowledge of stimulating con- 
ditions is sufficient for practical pur- 
poses. But where there are notable 
differences between the two views, it 
should pay to attempt to learn what we 
can regarding the learner’s cognitions. 
The learner’s evaluations would 
other importani sources of information 
will do and with 


be 


regarding what he 
what energy. 

The other problem, that of develop- 
ment of motor patterns, can possibly 
be approached profitably by giving at- 
tention to kinesthetic cognitions as a 
subject for study. The learner has a 
himself as 


flood of information from 
well as from his external environ- 
ment, information of which the experi- 


menter is unaware. A 
factorial exploration of the individual's 
use of his kinesthetic content should be 


fruitful. It should tell us whether 


systematic 


ILFORD 


information from this general source 
also comes in the form of the six kinds 
of products and what sorts of things 
these products are in the kinesthetic 
domain. The individual’s information 
concerning his responding equipment, 
its potentialities and limitations, from 
this source, should have an important 
bearing upon what he does with that 
equipment. 

TO OTHER PoINTs 
oF VIEW 


RELATIONS 


Some Major Issues 


Before comparing the point of view 
presented here with particular schools 
of thought, let us consider its position 
on a few of the major issues that 
divide those schools. One of the issues 
is sometimes called “mentalism,” and 
it is concerned with the question 
whether conscious experience is ac- 
cepted as a phenomenon and whether 
it plays a significant role in a system. 
My impression is that a system de- 
veloped from factorial information and 
thinking need not take sides on this 
issue ; the issue is not so very relevant. 
Actually, a completely objective sys- 
tem could be worked out, as one can 
see from the way in which the ap- 
plication to phenomena of learning and 
problem solving needed no reference 
to conscious experience or to con- 
cepts necessarily implying it. But if 
conscious experience is admitted as a 
necessary problem for psychologists, 
the factorial approach has much to 
offer in determining the dimensions of 
that experience and hence in finding 
descriptive concepts for it. 

The assertion that the factorial view 
can generate an objective system needs 
additional defense, for the terms 
nition” and “thinking,” particularly, 
arose in the context of mentalistic psy- 
chology. In spite of their origin, these 
terms, and others used in connection 


“cog- 
cog 





FACTORIAL ANGLES TO PSYCHOLOGY 17 


with the structure of intellect, can be 
given entirely objective interpretations, 
for the reason that they have observ- 
able referents. The referents are the 
test materials used in factorial in- 
vestigations. The factors, and in turn 
the factor categories, are interpreted 
in terms of the kinds of test content, 
the kinds of operations demanded by 
the instructions and the items, and the 
kinds of needed to make 
good scores. 

In one 
analogous to the atomic physicist’s 
cloud-chamber procedures. The phys- 
icist draws a conclusion regarding 
things that he cannot observe, from 
remotely connected 
scattered 


responses 


sense, factor analysis is 


observed events 
with them. From 
sources of observation he builds up a 
model of the atom and of its nucleus. 
From test and the ways in 
which they are interrelated, the factor 
analyst constructs conceptions of hu- 
man nature. The one seems as ob- 
jective as the other. 

The issue of associationism has al- 
ready been touched upon in discussing 
the nature of learning and it will be 
recalled that the outcome was to re- 
ject association as a principle that 
would account for the products of 
information. There are important 
areas of intellectual functioning in 
which development of understanding 
has been seriously handicapped by our 
staking so much on one principle. Be- 
cause the principle has served in so 
many places, in spite of objections and 


such 


scores 


necessary qualifications and omissions, 
overreliance on the principle has con- 
tinued. When the further restriction 
of recognizing only stimulus-response 
associations is imposed, psychology is 
further limited. This brings us to 
another issue. 

A third general issue may be called 
“centralism vs. peripheralism.” The 
extreme peripheralist not only re- 


stricts his observations to stimuli and 
responses, but also chooses to limit 
himself to descriptive concepts that 
pertain only to those events. The cen- 
tralist, on the other hand, although 
recognizing that we are limited to ob- 
servations of stimuli and responses, 
does not hesitate to attempt to infer 
something regarding processes that 
take place between those events. Hull 
and others have seen the need for 
inferring a few intervening events or 
conditions (called intervening vari- 
ables) but they attempt to get along 
with a bare minimum. 

It is my conviction that a psychol- 
ogy that restricts itself entirely to the 
task of correlating stimuli and re- 
sponses and that avoids attempts to 
build up a picture of what goes on 
between them will fall far short of 
complete success. The most significant 
things happen between stimuli and 
responses. It is these things that 
make known stimulus-response cor- 
relations most significant. In predict- 
ing behavior we need to know about 
internal conditions, including traits, as 
well as about the external situation. 
The same is true concerning the con- 
trol of behavior. The factor-analytic 
approach is aimed at a conceptualiza- 
tion of psychological properties of a 
central nature. Whether it can supply 
all the kinds of concepts that are needed 
for this purpose remains to be seen. 


Comparisons with Some Major Schools 


From the preceding discussion it 
should be clear that a psychology based 
upon factor analysis agrees with be- 
haviorism in achieving an objective 
science but it finds the behaviorist’s 


exclusive dependence upon the as- 
sociation principle to be a serious de- 
ficiency, and behaviorism’s restriction 
to peripheralism also seriously limiting. 

A factor psychology finds in the 
Gestalt point of view quite a number 





18 g 


of congenial features. As pointed out 
earlier, it finds the Gestalt emphasis 
upon totalities.to be useful as a re- 
placement for associationism. It finds, 
however, that Gestalt psychology failed 
to carry through its own program. 
In part this might have been due to 
its rapt admiration for totalities and 
to its aversion to analysis. Gestalt 
psychologists, by further elaboration in 
their thinking and by experimental 
approaches might have achieved a set 
of concepts such as those that describe 
the structure of intellect, but perhaps 
for lack of a method like factor anal- 
ysis they would not have succeeded. 
Factor theory, with its additive equa- 
tions, is no doubt anathema to Gestalt 
psychologists if they pay any attention 
at all to that theory. In spite of the 
summative equations of factor theory, 
factor analysis has not destroyed total- 
ities but has discovered what many of 
them are like, for example, the prod- 
ucts of information mentioned earlier. 


In terms of general views and ways 
of thinking, the kind of factorial psy- 
chology of which I have been speaking 
comes closest to the traditional point 


of view of functionaiism. The easy 
transition from factors to functions is 
the simplest indication of this similarity 
in type of fundamental thinking. Func- 
tional psychology never became very 
systematic, but through factor-analytic 
findings it could do so. One difference 
in the two views is that the function- 
alist has often emphasized Darwinian 
concepts as explanatory principles. 
This kind of explanation is not essential 
to a factorial point of view. 

And what of the “dynamic” psy- 
chologies? Among this group we gen- 
erally recognize the psychoanalysts, 
hormic psychologists, and the Lewinian 
psychologists. The strong intellectu- 
alistic bias of the discussion in this 
paper, which took its start from the 
structure of intellect, should not mean 


P. GUILFORD 


that factor analysis has nothing to 
offer regarding dynamic aspects of be- 
havior. I have skirted this aspect, 
suggesting that motivation and emotion 
can be related to the domain of intellect 
in connection with evaluation. An- 
other important point of contact would 
be the category of hypothesized be- 
havioral abilities. 

A more direct approach to dynamic 
aspects of behavior, however, is through 
the analysis of factors of motivation 
and emotion. Very much has already 
been done in this direction and Cattell 
(1957), in particular, has generated 
considerable theory from factorial in- 
formation concerning needs, interests, 
and attitudes. The integration of such 
theory with that derived from intel- 
lectual factors is something still to be 
done. 


SUMMARY 


In explaining how factor analysis 
and its findings can contribute to gen- 
eral psychological theory, the writer 
described three mcdels, derivable from 
factor theory or from the results of 
factor analysis, that apply to person- 
ality. Personality is defined as a mat- 
ter of individua} differences with re- 
spect to traits. One model is a space 
of m dimensions, each dimension rep- 
resenting a unique trait that is dis- 
coverable by factor analysis. Another 
model that emphasizes the structure 
of a typical personality represents 
traits at several distinct levels of gen- 
erality, one of the levels being devoted 
to primary traits. A third model 
arises from the logical classification of 
known primary traits in columns and 
rows to form two- and three-dimen- 
sional matrices. 

It is also necessary to discuss the 
adequacy of factor analysis as a method 
adaptable to the discovery of psycho- 
logical concepts having theoretical sig- 
nificance. Used in an appropriate type 


a 





FACTORIAL ANGLES TO PSYCHOLOGY 19 


of experimental design and applied to 
the appropriate kinds of experimental 
variables, factor analysis can be a 
powerful tool for the meaningful dis- 
crimination among traits. Hypothesis 
testing of a kind is possible, in spite of 
inadequate statistical tests of signifi- 
cance. 

Although factor analysis starts with 
data regarding individual differences 
in behavior and comes out with pri- 
mary traits that also refer to modes 
of individual differences, we find it 
possible to deduce corresponding ways 
of functioning in behaving organisms. 
In this manner we see a bridge be- 
tween factors and functions. The 
classes of factors, as derived from the 
third kind of model, suggest general 
concepts that seem to have theoretical 
significance for psychology. 

From the organization of intellectual 
factors we derive the impression that 
the organism is an agent that acquires 
information, retains it, uses it in gen- 


erating new information, and evaluates 
information in connection with any of 


these steps. Learning is the achieving 
of information, and the various intel- 
lectual abilities are abilities to learn. 
Similarly, problem solving rests upon 
certain combinations of primary abil- 
ities, depending upon the circum- 
stances. Reinforcement or effect in 
learning is accounted for by the opera- 
tions of evaluation. Drive reduction 
and effect supply information as bases 
for evaluation, along with other feed- 
back information. Motives and emo- 
tions, too, may be brought under the 
heading of information that concerns 
the organism’s internal states. 

The concept of association is very 
inadequate in accounting for the kinds 
of products with which the intellectual 
abilities are concerned. The products, 
such as units, systems, transformations, 
and implications, have about them a 
totality aspect that calls for something 


like the concept of Géstalt. The fac- 
torial approach offers a form of anal- 
ysis that should be useful to the Gestalt 
psychologist as well as to the func- 
tional psychologist. 

Like behaviorism, the proposed fac- 
tor psychology applies a completely 
objective approach and yields objective 
descriptions. Unlike behaviorism, it 
aims at the construction of a model of 
what goes on between stimulus and 
response rather than a model empha- 
sizing stimulus-response correlations. 
Also unlike behaviorism it finds no use 
for the concept. of association and 
would substitute the concept of cogni- 
tion-response sequences for that of 
stimulus-response associations. 

From a systematic standpoint, factor 
psychology is best described as a type 
of functional psychology, without any 
close affiliation with the traditional 
functional school. With further ex- 
tensions of factor-analytical efforts in 
the area of dynamics of behavior, we 
should be able to develop a more 
complete system than it is possible to 
present today. 


REFERENCES 


Burt, C. The structure of mind: A review 
of the results of factor analysis. Brit. J. 
Psychol., 1949, 19, 100-111, 176-199. 

Catrett, R. B. Personality and motiva- 
tion structure and measurement. New 
York: World Book, 1957. 

Eysenck, H. J. The organization of person- 
ality. J. Pers., 1951, 20, 101-117. 

Fercuson, G. A. On learning and human 
ability. Canad. J. Psychol., 1954, 8, 95-112. 

FLEISHMAN, E. A., & Hemper, W. E., Jr. 
Changes in factor structure of a com- 
plex psychomotor test as a function of 
practice. Psychometrika, 1954, 19, 239- 
252. 

FLEISHMAN, E. A., & Hemper, W. E., Jr. 
The relation between abilities and im- 
provement with practice in a visual dis- 
crimination reaction time task. J. exp. 
Psychol., 1955, 49, 301-312. 

GuttrorD, J. P. Factors in problem solv- 
ing. ARTC Instructors J., 1954, 4, 197- 
204. 








20 g. ¥. 


Guitrorp, J. P. Personality. New York 
McGraw-Hill, 1959. (a) 

Guitrorp, J. P. Three faces of intellect 
Amer. Psychologist, 1959, 14, 469-479. (b) 

GuTTMAN, L. A new approach to factor 


analysis : 
(Ed Ds 


wences 


The radex In P. F, 
Mathemati 


Glencoe : 


Lazarsfeld 


Free Press, 1954. 

A conceptual introduc- 
tion to latent structure analysis. In P. F. 
Lazarsfeld (Fd.), thinking 
in the social Glencoe: 
Press, 1954 

MaxweLit, A. E Statistical 
factor analysis. Psychol. B 
228-235. 


MERRIFIELD, P R 
( 


LAZARSFELD, P. F 


Vathematical 
iences Free 
methods in 


ull., 1959, 56, 


GUILFORD, J P., 
HRISTENS§N, P. R., & Frick, J. W 
\ factor-analytic study of problem solv- 
ing U. Southern Calif. Rep. Psychol. 


th., 1960, No. 22 


al thinking in the social 


(GUILFORD 


Mutter, N. E. food con- 
sumption as a measure of hunger: Results 
from other behavioral techniques. 
VY Acad. Sci. 1955, 63, 141-143. 


Oscoop, C. E., Suct, G. J.. & TANNENBAUM, 


Shortcomings of 


Ann 


P. H. The measurement of meaning. 
Urbana: Univer. Illinois Press, 1957. 
Seward, J. P. Basic issues in learning 
theory. In G. S. Seward & J. P. Seward 
(Eds.), Current psychological issues. 

New York: Holt, 1958. 


SPEARMAN, C. The nature of ‘intelligence’ 


and principles of cognition. New York: 
Macmillan, 1923. 
SPEARMAN, C. Abilities of man New 


York: Macmillan, 1927. 
VERNON, P. E. The structure of abilities 
New York: Wiley, 1950. 


( Received December ya 1959) 











Psychological Review 


1961, Vol. 68, No. 1, 21-32 


PERSONALITY THEORY AND THE PROBLEM 
OF STABILITY AND CHANGE IN 
INDIVIDUAL BEHAVIOR: 


AN 


PAUL F. 


INTERPERSONAL APPROACH? 


SECORD anp CARL W. BACKMAN 


University of Nevada 


Clinical psychologists, personality 
theorists, and most other students of 
individual behavior commonly assume 
that a person gradually forms charac- 
teristic behavior patterns which become 
more and more resistant to change with 
the passage of time. These patterns 
are usually thought of as reflections of 
intra-individual structures or 
nisms: e.g., habits, needs, cognitive 
structures, traits. these 
intrapersonal determinants of behavior 
are, in effect, accepted as givens, ob- 
servable consistency over a period of 


mecha- 


or Jecause 


time tends to remain unexplained—it 
simply a manifestation 
structures. From this point of view, 
a person behaves as he does because 
of what he is. 

Thus, Allport (1937), Cattell (1950), 
and Eysenck (1953) attribute endur- 
ing structure to personality, a structure 
in which the concept of trait is central. 
Murray (1938) conceives of personal- 
ity as an organizing and integrating 
force having a neural locus in the or- 
ganism. In spite of his field emphasis, 
Lewin (1935) conceives of personality 
structure as characterized by certain 
differentiations and articulations of re- 
gions, which correspond to aspects of 
Murphy (1947) 
includes physiological dispositions, ca- 


is of these 


personal character. 


nalizations, conditioned responses, and 
cognitive and perceptual habits as com- 


1 This investigation was supported in part 
a grant (M-1892) from the National In 
stitute of Mental Health, United States 
Public Health Service. 


by 


21 


Two cognitive 
orientations are those of Rogers (1951), 
and Combs and Snygg (1959), who 
conceive behavior organized 
around and guided by a relatively en- 
during self-concept. Finally, almost all 
forms of personality assessment are 
based upon the assumption that the 
individual undergoing appraisal has a 
stable personality structure which the 
investigator is attempting to describe, 
in part or in whole. Murphy (1947) 
has summed up this attitude succinctly. 
He states: 


ponents of personality. 


of as 


The laboratory psychologist and the clinician 
have both conceived the individual as a sys- 
tem of events and tendencies carried around 
within the skin of the individual subject. 
Chey have both assumed that his delinquen- 
cies or his triumphs, his interests or nervous 
maladjustments, result fully and siniply from 
structured dispositions within him (p. 877). 


For all of these theorists, then, per- 
sonality has a certain stability of struc- 
ture which in turn maintains continuity 
in behavior over time. This is not to 
assert that behavior is viewed by these 
theorists as unchanging or unrespon- 
sive to varying stimulus situations. On 
the contrary, many of these personolo- 
gists often give careful attention to 
stimulus situations as modifiers of be- 
havior. But, sometimes explicitly and 
sometimes implicitly, they do appear to 
make the assumption that when be- 
havioral stability occurs, it is a function 
of stability in personality structure. A 
closely related assumption is that per- 
sonality structure has a strong resist- 














22 PAUL F. 


ance to change ; in the absence of special 
change-inducing forces or conditions, it 
remains constant. 

These assumptions have two conse- 
quences. The first is that continuity 
in individual behavior is not a problem 
to be solved ; it is simply a natural out- 
come of the formation of stable struc- 
ture. The that either be- 
havioral change is not given systematic 
attention, or change is explained inde- 
pendently of stability. Whereas be- 
havioral stability is explained by con- 
stancy of structure, change tends to be 
explained by environmental forces and 
fortuitous circumstances. A more par- 
simonious approach would be to ac- 
count for both stability and change by 
means of a single set of explanatory 
principles. The following theoretical 
statement abandons the assumption that 
internal stabilizing mechanisms are in- 
herently resistant to change and are 
the sole source of stability in individual 


second is 


behavior. 

The present paper places the locus 
of stability and change in the inter- 
action rather than in intra- 
personal structures. At the same time, 
the importance of habit, cognitive struc- 
ture, and similar concepts is not denied, 
but these mechanisms do not by them- 
selves establish necessary conditions 
for behavioral stability. Stable patterns 
in the interaction process, which con- 
stitute the essential condition for con- 
sistency in individual behavior, may be 
thought of as deriving from two major 
sources. One source is the cultural, 
normative, and _ institutional forces 
which stabilize the behavior of persons 
with whom a particular individual in- 
teracts, as well as his behavior toward 
them. With this important source the 
present paper is not concerned; social 
psychologists, sociologists, anthropolo- 
gists, and others have analyzed these 
forces in extensive detail. 


process 


‘he other source of stability, al 


SECORD AND CarRL W. 


BACKMAN 


though it has been given some atten- 
tion by social psychologists, has been 
relatively neglected by students of in- 
dividual behavior. It has been custo- 
mary to think of two classes of be- 


havioral determinants: cultural and 
normative forces, and those forces 
stemming from the individual. The 


present paper attempts to identify a 
third class of determinants, which have 
their locus neither in the individual nor 
the culture, but in the interaction proc- 
ess itself. In a general sense this third 
class may be characterized as the tend- 
encies of the individual and the persons 
with whom he interacts to shape the 
interaction process according to certain 
requirements, i.e., they strive to pro- 
duce certain patterned relations. As 
will be seen, the principles governing 
this activity are truly interpersonal ; 
they require as much attention to the 
behavior of the other as they do to the 
behavior of the individual, and it can- 
not be said that one or the other is 
the sole locus of cause. 

Because of limitations of space and 
in the interests of clarity, the tneory 
will be presented without attempting to 
specifically examine the empirical evi- 
dence, but those familiar with the lit- 
erature will recognize that much sup 
porting evidence exists. For definitive 
support, however, an extensive research 
program guided by the theory needs to 
be carried out. 


SOURCES OF STABILITY IN 
[INDIVIDUAL BEHAVIOR 


Definition of Interpersonal Matrix and 
Matrix Congruency 


The locus of behavioral stability and 
change lies in the interpersonal matrix, 
which has three components : an aspect 
of the self-concept of the subject (S), 
S’s interpretation of those elements of 
his behavior related to that aspect, and 
S’s perception of related aspects of the 











PERSONALITY THEORY AND THE PROBLEM OF STABILITY 23 


other person (O) with whom he is 
interacting. An interpersonal matrix 
is a recurring functional relation be- 
tween these three components. The 
behavior of a particular person is epi- 
sodic in character, in that he shifts 
from one matrix to another over vary- 
ing periods of time. A matrix is re- 
ferred to as the same if, on two or more 
occasions, the same self-component, S- 
behavior, and O-component are present 
in the same functional relation. The 
present theoretical treatment focuses 
mainly on the discussion of single ma- 
trices. Also, only aspects of self and 
behavior which are valued by S are 
dealt with, and these are assumed to 
maintain relatively constant values. 

S strives to achieve congruency ” 
among the components of the matrix. 
Congruency is a cognitive phenome- 
non: i.e., each component enters into 
a state of congruency only as a percep- 
tual cognitive experience on the part 
of S. All three components of the 
matrix are in a state of congruency 
when the behaviors of S and O imply 
definitions of self congruent with rele- 
vant aspects of the self-concept. In a 
nurturant-dependent relation, for ex- 
ample, a dependent S has conceptions 
of self containing elements of inade- 
quacy and need for support. His be- 
havior is consistent with these aspects 
of cognitive structure, for it is hesitant 
and compliant, reaffirming his self- 
definition. Since the behavior of O 
is perceived as providing assistance 
and as reassuring and decisive, it im- 
plies a matching definition of S’s self- 
concept and behavior. In the above 
example, S’s interpretation of his be- 
2 Congruency as used here has some simi- 
larities to the following: Abelson’s (1958) 
resolution of belief dilemmas, Cartwright 
and Harary’s (1956) concept of structure 
balance, Festinger’s (1957) theory of cogni- 
tive dissonance, Heider’s (1958) theory of 
balance, and Osgood and Tannenbaum’s 
(1955) concept of congruence. 


havior is congruent with a relevant 
aspect of the self, S’s perception of O 
is congruent with the self-aspect, and 
S’s interpretation of his behavior is 
congruent with his perception of O. 
Thus, any two components of a matrix 
may be congruent with each other, with 
the third either congruent with both 
of them, or not congruent with either. 

Implications for self-definitions may 
take three forms: S may perceive O’s 
behavior as directly confirming a com- 
ponent of self, O’s behavior may en- 
able S to behave in ways that would 
confirm a component of self, O’s be- 
havior may (by comparison) lead other 
Os to confirm a component of S’s self- 
concept. Examples of each form are: 


An S who regards himself as mature and 
responsible perceives that Os respect him for 
these characteristics. 

An S who regards himself as nurturant 
encounters an O in need of help; this allows 
him to behave toward O in a manner which 
supports his nurturant aspect of self. 

A girl who regards herself as popular and 
well-liked keeps company with an unpopular 
girl; Os are viewed by her as judging her 
favorably by contrast. 

Reality-oriented vs. cognitively dis- 
torted matrices. A matrix is reality- 
oriented if the S-behavior component 
and O-component are correctly inter- 
preted or perceived. On the other 
hand, a matrix which involves misin- 
terpretation of S’s behavior or mis- 
perception of O is not reality-oriented ; 
it involves some form of cognitive dis- 
tortion. It is obvious that congruency 
would sometimes be achieved by means 
of cognitive distortion. Presumably a 
congruency achieved by this means 
would be less stable than a reality- 
oriented congruency.’ At least this 
would be true if there were forces tend- 
ing to produce accurate interpretations 
and perceptions. On the other hand, 
incongruencies which might be made 
congruent simply by correcting cogni- 
tive distortions to fit reality are not 





24 PAUL F. 


regarded as particularly stable or sig- 
nificant, and thus are not further dis- 
cussed. 

The tendency to achieve and main- 
tain congruent states. A number of 
general principles may be presented 
concerning the maintenance of con- 
gruent states. In many instances these 
will be elaborated later. 

1. S tends to repeat and perpetuate 
those interpersonal 
were previously characterized by con- 


relations which 
gruency. 

2. An S involved in a matrix which 
is not in a state of congruency will tend 
to modify the matrix in the direction 
of greater congruency. 

3. The engagement of S and O in 
congruent interaction develops mutual 
affect toward each other, which tends 
to perpetuate the interaction. 

4. Because of the tendency to estab- 
lish congruent matrices, S gradually 
builds up an increasingly greater reper- 
tory of such matrices. 

5. The more the O-component of a 
congruent matrix is valued, the greater 
the tendency of the matrix to be per- 
petuated. 

Relations and the 
perpetuation of congruent states. <A 


among matrices 
given matrix may be considered rele- 
which contain 
similar 
For 
example, for an S who considers him- 
self to be a kind person, there will be 
many S-behaviors and 


vant to those matrices 
one or more of the 


components as the given matrix. 


Same or 


O-behaviors 
which are perceived as forming con- 
gruencies and inconguencies with this 
The various matrices 
formed by distinctive S-behaviors and 
Q-behaviors in with the 
self, kindness, are all con- 
sidered relevant 
\ relevant 
supportive or 
respect to a given matrix. 


aspect of self. 


connection 
aspect of 
to each other. 

either 
with 
Supportive- 
deter- 


matrix 


nons upportive 


may be 


between two matrices is 


ness 


SECORD AND CARL W. 


3ACKMAN 


mined by whether or not the compo- 
nont(s) common to them has the same 
relation (one of congruency or one of 
incongruency) to the other unique 
components in each matrix. If the 
common component has the same rela- 
tion in both matrices, they are said to 
be supportive. 


For example, suppose a woman regards her- 
unattractive to men. Her mother 
continually calls her attention to the fact that 
women in her family were seldom noted for 
their attractiveness. Our S also finds that 
a man she likes at work avoids her as much 
In the one matrix, unattractive 
ness is congruent with her mother’s direct 
definition of her as unattractive and, in the 
other, the avoidance behavior of the man 
leads her to perceive his behavior as defin- 
ing her as unattractive. Thus, one matrix 
supports the other. On the other hand, if 
her mother perceived many good features in 
her appearance and stressed these, and the 
man professed admiration for her, both ma- 
trices would be incongruent with her notion 
of self as unattractive. These two matrices 
would also support each other—they agree 
in denying an aspect of self. These might 
be termed incongruent supportive matrices, 
and the former type, congruent supportive 
matrices. 


self as 


as possible. 


A statement may now be made with 
respect to the role of a complex of 
matrices in perpetuating a given ma- 
trix. Matrices vary with respect to 
centricality. The centricality of a ma- 


trix is a function of the number of 


other matrices which stand in a sup- 
portive relation to it, and the value of 
the O-components in these matrices. 
The greater the centricality of a matrix, 
the more resistant it is to change, and, 
should it change, the greater the re- 
sultant shifts in other matrices. 


Interpersonal Processes 
to Congruent States 


Contributing 


Congruency is continually threatened 
by the changing nature of interpersonal 
relations due to: (a) normative pat- 
terns of change, which result in changes 
in the behavior of others towards S, 





PERSONALITY THEORY AND 
e.g., changes in the behavior of others 
as the child grows older, or as a person 
changes occupations, gets married ; (>) 
fortuitous changes, e.g., death of a 
parent, loss of a friend, induction into 
the military; and (c) the fact that O’s 
means of establishing congruency for 
himself often create incongruencies for 
S. A number of interpersonal proc- 
esses may be suggested which counter 
these pressures for change and which 
perpetuate previous matrices. 

The first five interpersonal processes 
restore congruency through some trans- 
formation of the O-component of the 
matrix : 

1. Selective interaction with Os. S 
tends to maximize engagement in con- 
gruent patterns of interpersonal be- 
havior by selecting and interacting 
with those Os whose behavior requires 
a minimum change from previously 
congruent interpersonal situations in 
which S has engaged. For example, 
if S regards himself as especially intel- 
ligent, he tends to interact frequently 
with Os who respect his intelligence or 
who allow him to exercise it, or he 
interacts in such a way as to make 
himself appear intelligent by 
parison 

2. Selective evaluation of Os. S 
tends to maximize congruency by al- 
tering the evaluation of selected Os in 
a positive or negative direction, de- 
pending upon whether they are behav- 
ing congruently or incongruently with 
certain aspects of self. The more an 
O is valued by S, either positively or 
negatively, the more important his role 
"2 4 This 
provides a means of increasing the 
effects of congruency or reducing the 
effects of incongruency. Thus, S tends 
to increase his liking for Os who be- 
have toward him in a congruent fash- 
ion, and to decrease his liking for those 


com- 


in maintaining congruency. 


who behave in an incongruent manner. 
3. Selective comparison with aspects 


THE PROBLEM OF STABILITY 


of O. S tends to maximize congru- 
ency by selectively attending to be- 
haviors of O which are congruent, and 
selectively ignoring those which are 
incongruent. For example, a patient 
in therapy particularly notes those be- 
haviors on the part of the therapist 
which confirm aspects of his self-con- 
cept and behavior, and tends to ignore 
those which are incongruent. 

4. Evocation of congruent responses 
from O. S develops techniques for 
eliciting from O behavior which will 
be congruent with components of his 
self-concept and behavior. For exam- 
ple, a feminine woman plays a helpless 
role in order to elicit dominance from 
men. 

5. Misperception of O. S may mis- 
perceive O’s behavior so as to achieve 
congruency with aspects of his behavior 
and self-concept. Thus, an S has a 
high regard for himself as a lover and 
perceives other women as more inter- 
ested in him than they really are. 

The sixth and seventh interpersonal 
restore congruency by a 
transformation of the S-behavior com- 
ponent of the matrix: 

6. Selective behavior-matching. In 
interacting with a particular O, S tends 
to maximize congruency by selecting 
from his total behavioral repertory 
those behaviors which are most con- 
gruent with his perception of O. This 
principle reflects William James’ (1890, 
Vol. 1, p. 294) notion that each person 
has as many “social selves” 


processes 


as there 


are classes of persons whose opinion 


he cares about. 

7. Misinterpretation of own behavior. 
S may misinterpret his behavior so as 
to achieve maximum congruency with 
an aspect of his self-concept and his 
perception of O. For example, a duti- 
ful mother spanks her child because of 
anger and frustration, but convinces 
herself that she is doing it for the 
child’s own good, 





26 PauL F. Secorp 

Certain combinations of these prin- 
ciples may sometimes operate under a 
single rubric. For example, an indi- 
vidual may select a soctal role which 
enables him to achieve maximum con 
gruency among the three components. 
This involves interaction with selected 
Os who will engage in certain desired 
reciprocal behavior, and also permits 
behaviors validate the self. 
Thus, a woman who regards herself 
as intelligent, independent, and am- 
bitious may reject the marital role in 
favor of a career role. A second ex 


which 


ample is found in circular interaction 


systems. In these, response evocation 
tends to become an activity mutual to 
two persons: the actions of S give rise 
to behavior on the part of O which in 
turn evoke further actions of the type 
initially displayed by the individual. 
For example, aggressive actions on the 
part of the mother, initiated in the 
attempt to control or eliminate unde- 
sirable aggressive behavior in her child, 
raise his frustration level, which in turn 
gives further impetus to his tendency 


to respond in aggressive ways. 


TOWARD CHANGE AND THI 
RESOLUTION OF THESE Forces 


FORCES 


States of Incongruency and Their Con 

sequences 

Any theory of stability in self and 
behavior must also be able to explain 
change. There are three steps leading 
to change: the creation of an incon- 
gruency, the formation of a new con- 
gruent matrix which involves a differ- 
ent component of self or behavior from 
that existing prior to the change, and 
the adjustment of relevant matrices 
which have been affected by the changes 
made in resolving the incongruent ma- 
trix. As mentioned earlier, incongru- 
encies arise from several sources: from 
cultural, normative, and institutional 
forces, from fortuitous factors, 


and 
from O’s attempts to establish con- 


AND CarRL W. 


3ACKMAN 


gruency for himself. Incongruencies 
occur frequently, but are often fol- 
lowed by restoration of a congruent 
matrix which does not involve a change 
in self or behavior. Whether or not 
change occurs, resolution of the newly 
formed incongruency occurs through 
utilization of one or more of the inter- 
processes previously 
cussed. Thus, a single set of inter- 
personal processes are presumed to 
underlie both behavioral stability and 
the direction of behavioral change. 


personal dis- 


Four types of incongruency are 
identified in the following discussion. 
Of considerable interest and impor- 
tance is the manner in which the vari- 
ous types of incongruency may be re- 
solved. As already noted, there are 
two general classes of resolution. One 
of these results in restoration of the 
original matrix, leaving self and be- 
havior unchanged (although cognitive 
distortions may occur), and the other 
leads to a new matrix in which self or 
behavior are changed. In order to 
avoid being tedious, only a brief, in- 
formal discussion of the forms of reso- 
lution will be presented. Table 1, 
however, presents in symbolic form a 
complete set of resolutions for the first 
three types of incongruency. The 
fourth is omitted since it is mainly of 
academic interest and is probably quite 
rare. 

Type I. 
ceived as 
ponent of self and with S’s behavior, 


The behavior of O is per- 
incongruent with a com- 


the latter two elements being congruent 
with each other. 


For example, S, an alcoholic, who regards 
himself as a “weak,” inadequate person, 
neglects his family by going on extensive 
binges, implying a definition of self as in- 
adequate. O, his wife, behaves toward S 
as if he were capable of controlling his 
drinking. 

In this type of incongruency, S may em 
ploy any one of the first five interpersonal 
processes to transform the incongruent O 





PERSONALITY THEORY AND THE PROBLEM OF STABILITY 


TABLE 1 


RESOLUTION OF BAsic TYPES OF 
INCONGRUENCY 
Let E represent the 
matrix 
B represent S-behavior as a reality- 
oriented matrix component 
O represent O-behavior as a reality- 
oriented matrix component 
B’ represent a matrix component based 
on misinterpretation of S-behavior 
O’ represent a matrix component based 
on misperception of O-behavior 
X represent an incongruent form of a 
component X 
X, represent a change in a component X 


self-component of 


Type of Resolution 


Main- . Change in 
Type of | tenance C “a Cian ta | S if. 
Incon- of Self Sel lB + g | Concept 
yruency od [a | Serer and 
8 . . Concept - 
Behavior Behavior 


SBO| SB SeB’O | Not ap ScBeO 
SB | plicable 

SBO. 

SBO,. 


SBO’ 


SB’ | SBe 

SB’ | SB. 
SB’O- | SBeOc 
SB’‘O- SB.O. | 
SB’O’ SB.O’ | 


Not ap 
plicable 


III, sBo | 


SB’O SB SB.O_ | Not ap- 
SeB | plicable 
SeBO. 
S-eBO- 
SeBO’ 


Note.—Resolutions involving O are listed in order 
of the five interpersonal processes which apply to O 
(a) reduced interaction—SB, (6) devaluation—SB, 
(c) G-behavior selection—SBO.¢, (d) response evocation 

SBOc, and (¢) misperception—SBO’. 


component, and thereby maintain self and 
behavior unchanged. Thus, he may reduce 
interaction with his wife, may devaluate her, 
may avoid selective comparison by declaring 
her opinion irrelevant, may evoke new re- 
sponses from her, or may falsely perceive her 
as trying to give him a pep talk. 

Other forms of achieving congruency 
require a change in self or behavior. 
S may change both components, or he 
may change self but not behavior by 
misinterpreting his behavior. 

Type II. An aspect of self is in- 
congruent with S’s behavior and with 
the perceived behavior of O, the latter 
two elements being congruent with 
each other. 


For example, a woman behaving seductively 
gets matching responses from O, but regards 
herself as morally conservative. 

One form of resolution involves mainte- 
nance of self and behavior. Since behavior 
is incongruent, it would have to be misin- 
terpreted if self is to be maintained, and in 
addition, the five interpersonal processes 
which transform the incongruent O compo- 
nent would be employed here. For example, 
behavior is misinterpreted if S regards her 
behavior as casually flirtatious, or as “not 
me” because she had been drinking, and, 
with respect to O, S might reduce inter- 
action with him, regard his behavior as 
harmless flirtation, or she might make her 
own behavior irrelevant by defining him as 
a man who would try to seduce any woman, 
etc. 

A simple change in self would result in 
a congruent matrix. Or, finally, S might 
change her behavior and transform the O- 
component. 


Type II]. S’s behavior is incon- 
gruent with a component of self and 
with the perceived behavior of O, the 
latter two elements being congruent 
with each other. 


For example, a wife regards herself as 
loving, but realizes that she is often hostile 
toward her husband. Her husband fails to 
recognize much of the hostility and behaves 
toward her as if he perceives her as loving. 

If, in this instance, S changes her self- 
concept to match her unchanged behavior, 
she may employ the five interpersonal proc- 
esses to transform the O-component. Thus, 
she may leave her husband, devaluate him, 
emphasize occasions on which he has been 
angry with her, evoke hostility from him, 
or think that he secretly does recognize her 
hostility. If she maintains self-concept and 
behavior, she must necessarily misinterpret 
her behavior in order to achieve congruency. 


Type IV. All three matrix com- 
ponents are incongruent with each 
other. Two general forms of resolu- 
tion are: (a) any two of the compo- 
nents may be changed to match the 
third; (b) all three may be modified 
to achieve a new matrix congruency, 
Type IV incongruency is thought to be 
rare and is not discussed further. 





28 PauL F. 


Factors Governing the Probable Di- 
rection of Resolution 


The various forms of resolution of 
incongruent matrices are not equally 
probable. Which ones are more likely 
to occur is in part a matter for em- 
pirical study. In the meantime, how- 
ever, a number of factors relating to 
the probability of change in the various 
components may be identified. In the 
first place, O’s behavior toward S may 
be systematic and consistent because it 
stems from certain factors external to 
the dyad, such as role expectations 
e.g., O may be S’s therapist, or he may 
be a parent socializing his child. An 
obvious second consideration is the 
ease with which S$ may leave the dyad. 
A patient may readily leave therapy, 
but a child cannot usually leave his 
family. A third condition pertains to 
the extent to which an individual’s con- 
gruent relations are reality-oriented, 
i.e., to what degree misinterpretation 
and misperception are required for con- 
gruency. Matrix which 
are not reality-oriented tend to be less 
stable. A fourth point is that the greater 
the number and value of supportive con- 
gruent matrices which are relevant to 


components 


a component, the more resistant is that 
component to change. Finally, it may 
be postulated that the individual learns 
to utilize selectively the various modes 
of resolution. 

vidual, certain 
transformation are more practiced and 


For a particular indi- 
types of component- 
are preferred to others. 


DISCUSSION 


\n important difference between the 


present approach and earlier ones 
should be re-emphasized. Such tradi- 
tional concepts as cognitive structure, 
self-concept, trait, and habit are prod- 
ucts of earlier social interaction. But 
the assumption is not made that per- 
strive to maintain either their 


sons 


SECORD AND CarL W. BACKMAN 


self-concept or various habitual be- 
haviors by virtue of some inherent 
force residing in these 
dispositions. Maintenance of intra- 
personal structure occurs only when 


“gyroscopic” 


such maintenance is consistent with an 
ongoing interaction process which is in 
That most in- 

intrapersonal 


a state of congruency. 

dividuals do maintain 
structure is a function of the fact that 
the behavior of others toward the indi- 
viduals in question is normally over- 
whelmingly consistent with such main- 
tenance. This fact has often been 
overlooked. It should be made clear 
that the concept of structure continues 
to have a place in the present theory. 
Matrices are structures; they are as- 
pects of perceptual-cognitive experi- 


ence. Unlike habit and trait, however, 


they tend to be more integrated with 


certain behaviors of O. Even the self- 
concept, employed in the present theory 
as a matrix component, is always tied 
to certain behaviors of S and O. 

environment is 
The 


were 


The interpersonal 
not always stable and familiar. 
present approach predicts that 
the interpersonal environment to sud- 
denly undergo drastic change, with 
others uniformly behaving toward the 
individual in markedly new and strange 
ways, an individual held in such an 
environment would rapidly modify his 
own behavior internal structure 
to produce a new set of congruent ma- 
trices. As a result, he would be a 
radically changed person. This pre- 
diction is consistent with the experi- 
ences of some soldiers imprisoned and 
“brainwashed” by Chinese communists. 
Some investigators have emphasized as 
a key factor in successful brainwash- 
ing the isolation of the individual from 
relations. 


and 


his normal interpersonal 
Marked changes in an individual may 
also occur as a result of shifts in a 
single matrix, if that matrix involves 


a highly valued O, e.g., a therapist. 





PERSONALITY THEORY 


As noted earlier, there are complexes 
consisting of matrices relevant to each 
other because they contain at least one 
common component, and a shift in one 
highly valued matrix may have rami- 
fications throughout the structure of 
relevant matrices. In a word, the dif- 
ferences in emphasis between the pres- 
ent and previous approaches to be- 
havioral stability is that the individual 
strives to maintain interpersonal rela- 
tions characterized by congruent ma- 
trices, rather than to maintain a self, 
habits, or traits. Some of the advan- 
tages of this approach are the fol- 
lowing : 

1. The approach provides a place in 
theory for the variability of individual 
behavior in different interpersonal sit- 
uations. For example, it does not 
make the questionable assumption that 
an individual belongs at fixed points 
with respect to each of a variety of 
personality traits in all interpersonal 


situations. A given individual is un- 


likely to be equally aggressive in inter- 
action with every other person or in 
every situation involving the same per- 
Yet, a typical personality ap 
praisal instrument yields a single score 
on “aggressive” for an individual. In 


son. 


the present view, trait behaviors sucl 
as aggressiveness tend to appear only 
when they are congruent with certain 
O-behaviors and certain components of 
self. This idea calls for an overhaul of 
our traditional methods of personality 
assessment. 

2. While self-theory emphasizes that 
self emerges out of social interaction, 
it is rather vague on details. In fact 
it implies that the only role of the be- 
havior of others in supporting the self 
is that expressed by the first type of 
congruent O-behavior, namely, —be- 
havior which directly defines an S- 
component of S in the same way as § 
Moreover, the organism is often 
assumed to be passive in this process. 


does. 


AND THE PROBLEM OF STABILITY 29 


For example, the fact that S may ac- 
tively seek out those who support com- 
ponents of self, may evoke certain re- 
sponses on the part of the other, etc. 
is often ignored. 

3. The interpersonal framework 
makes the process of individual change 
less mysterious and less difficult to 
explain. Individual behavior is always 
linked to the behavior of others, and 
the key to deliberate change lies in 
creating shifts in the perception of the 
behavior of Os toward S. This is ac- 
complished either through bringing 
about actual changes in the behavior of 
Os toward S, or by blocking various 
techniques that S uses to distort or to 
evoke certain O-behaviors which tend 
to support his current 
Socialization and psychotherapy will 
shortly be discussed in order to illus- 
trate the application of the theory to 
problems of change. 

4. It is believed that 
the perceptual-cognitive level are more 
directly assessable in view of the abil- 
ity of Ss to directly describe their self- 
concept, their behavior, and the behav- 
ior of Os. In addition, the question of 
cognitive distortion can be checked by 
obtaining information by direct obser- 
vation of the interaction, or by obtain- 
ing descriptive information from O. 
In contrast, such concepts as need, 
habit, and trait must be inferred, and 
there has been considerable controversy 
over the validity of various procedures 
of inference in regard to these varia- 
Another advantage stems from 
the fact that, in the experience of the 
present investigators, matrix variables 
are readily manipulable when using 
conventional experimental procedures 
in social psychology. For example, an 
incongruency can be readily created in 
the laboratory by false test protocol 
reports to Ss concerning aspects of self 
and behavior, or by placing S in the 
laboratory with an O who has been 


perceptions. 


variables on 


bles. 





30 Pau. F. SEcorp 
instructed by the experimenter to be- 
have in an incongruent fashion. 

5. The present point of view should 
be helpful in integrating concepts of 
personality with those of social psy- 
chology. At present there is a tend- 
ency to conceptualize each of these 
disciplines in such a way as to make 
difficult, if not impossible, the simul- 
taneous application of concepts from 
both areas to consideration of any par- 
ticular behavioral phenomenon. Dy- 
adic relations will be briefly touched 
upon to indicate how a rapprochement 
between these areas might be initiated. 
Also, by examining the role of culture 
in establishing congruencies and in- 
congruencies, light may be thrown on 
variations in individual strain from cul- 
ture to culture, and on a source of 
cultural change, namely, the attempt of 
individuals to resolve such strains. 

Socialization and_ psychotherapy. 
The process of socialization involves 
those changes occurring as a result of 
movement through the social structure. 
Psychotherapy has a less systematic 
relation to social structure, and may be 
thought of as a process of change re- 
sulting from highly significant inter- 
action between patient and therapist. 
3oth socialization and psychotherapy 
may be examined briefly to determine 
whether or not the process of change 
can be conceptualized in terms of the 
present interpersonal theory. 

Socialization may be viewed as a 
process in which the person moves 
through a succession of changing pat- 
terns of O-behaviors over a long pe- 
riod of time. These changing patterns 
are culturally induced, and create in- 
congruencies for S. In this process 
there is a certain amount of resistance 
to change on the part of S, arising out 
of the fact that typically the changing 
behavior of Os toward him requires 
continual reorganization of previously 
established matrices. Behaviors which 


AND 


Cart W. BACKMAN 

previously led readily to congruent ma 
trices now bring forth incongruent re 
actions from O. Take, for example, 
the small child who has an overpro- 
tective mother. Here self-cognitions 
are rudimentary and limited, but if 
existing components were phrased in 
adult language, they might include the 
following: “I am the baby of the 
family.” “Daily family life revolves 
around me.” “I can get most anything 
I want.”” When this child moves out 
of his mother’s sphere of control, how- 
ever, and plays with other children, he 
meets many responses incongruent with 
Other chil- 
needs 


these components of self. 
dren are likely to thwart his 
rather than meet them; he is unlikely 
to be the center of play activities. 
These incongruencies force change in 
self-components and behavior. To the 
extent that an overprotective mother 
limits a child’s contacts with other chil- 
dren, and reduces the extent to which 
other children are valued, components 
of self inappropriate to interaction out- 
side the family develop in the child. 
Normal development in the sociali- 


zation process involves removing in- 
congruency by changing components of 
self and behavior so that they corre- 
spond with the definitions made by 


significant Os. Because of the diffi- 
culty of changing S-components which 
are relevant to an increasingly larger 
number of congruent matrices, how- 
ever, the individual also learns to use 
other ways of handling incongruency : 
€.g., misperception, misinterpretation, 
response changing valua- 
tions of others, etc. These latter ac- 
tions delay or prevent changes in self 
It is probable that early 


evocation, 


or behavior. 
adjustment to incongruency involves 
changing the self-concept and behavior, 
but as the individual develops these 
other techniques, he frequently handles 
incongruency by means of them. Dis- 
turbances of the developmental se- 





PERSONALITY THEORY AND THE PROBLEM OF STABILITY 31 


quence may frequently occur as a result 
of overuse of modes of reducing incon- 
gruency which involve distortion of 
reality instead of change in self or 
behavior. This may be expected to 
result in the establishment of a some- 
what precarious stability of self at a 
low level of maturity. This is a social 
psychological interpretation of the 
Freudian phenomenon of fixation. 
The interpersonal theory as outlined 
also seems to point to a crucial process 
in psychotherapy. The modes of re- 
ducing incongruency spelled out in an 
earlier section suggest that the pri- 
mary problem facing a psychotherapist 
is the necessity for blocking all modes 
which permit the maintenance of the 
undesirable aspects of self and _be- 
havior, and facilitating those which in- 
volve change in these components. 
The advantage of a systematic theory 
providing an exhaustive classification 
of the various forms of reducing incon- 


gruency is that it allows the therapist 
to anticipate various ways in which the 


individual may fail to change. There 
are two directions, however, in which 
the theory must be further elaborated. 
One involves spelling out the behav- 
iors required of the change agent for 
successful facilitation of modes of reso- 
lution leading to desired changes in the 
self-concept and behavior of the S. 
The other requires the elaboration of 
techniques for blocking those modes 
leading to mairtenance of undesirable 
aspects of self and behavior. 

Dyadic relations. So far the focus 
has been on stability and change in 
individual behavior. Emphasis has 
been placed on the point that stability 
and change in individual behavior is in 
part a product of a reciprocal process 
having its locus in the interactions be- 
tween the individual and those around 
him. This reciprocal process may also 
be applied to an understanding of both 
members of the dyad. The instability 


of adolescent love relations would ap- 
pear to be initiated by the confronting 
of the adolescent with marked and fre- 
quent changes in the behavior of peers 
and adults which are incongruent with 
components of self and his behavior. 
These changes lead to shifts in self- 
concept and behavior, and in turn new 
relations are sought to establish con- 
gruency with these modified aspects of 
self and behavior. The disruptive ef- 
fect of separation on a friendship or a 
marriage is another case in point. The 
new interpersonal environment con- 
tains Os who define the individual’s 
self and behavior differently, and he 
gradually changes so as to establish 
congruency in this new interpersonal 
situation. On the other hand, old rela- 
tions become less congruent, and con- 
sequently progressively weaker in af- 
fect. These changes are dramatically 
illustrated by the disappointment so 
often experienced at the reunion after 
long separation of boyhood friends or 
ex-army buddies. Ina similar fashion, 
induced changes in personality, such as 
those which occur in psychotherapy, 
and fortuitous ones which arise as a 
result of entering a new job, obtaining 
more formal education, etc. may well 
result in drastic shifts in dyadic rela- 
tions. 


SUMMARY AND CONCLUSIONS 


Most students of individual behavior 
commonly assume that when behavioral 
stability occurs, it is a function of sta- 
bility in personality structure, and that 
in the absence of special change-induc- 
ing forces, structure remains constant. 
Thus continuity in individual behavior 
is not a problem to be solved; it is 
simply a natural outcome of the for- 
mation of stable structure. Behavioral 
change, moreover, either is not given 
systematic attention, or is explained 
independently of stability, in terms of 
environmental forces and fortuitous 











32 PAuL F. SEcORD 


circumstances. The present interper 
sonal theory attempts to account for 
both stability and change in terms of 
a single set of explanatory principles. 

In the present view, the locus of 
behavioral stability and change lies in 
the interpersonal matrix, which has 
three an aspect of the 
self-concept of S, S’s interpretation of 


components: 


those elements of his behavior related 
to that aspect, and S’s perception of 
related aspects of the person with whom 
(O). S strives to 
achieve congruency among the compo- 
which is 


he is interacting 


nents of the matrix, a state 
achieved when the behaviors of S and 
the other imply definitions of self which 
are consistent with relevant aspects of 
the self-concept. Congruency is con- 
tinually threatened by the changing na 
ture of interpersonal relations due to: 
(a) normative patterns of change, 
which result in changes in the behavior 
of Os toward S, e.g., changes in the 
behavior of Os as the child grows older, 
or as a person changes occupations or 
gets married; (b) fortuitous changes, 
e.g., death of a parent, loss of a friend, 
induction into the military; and (c) 
the fact that O’s means of establishing 
congruency for himself often create in 
congruencies for S. 

When incongruencies arise, a variety 
of interpersonal processes which have 
been carefully described here often act 
to restore congruency without change 
in self or behavior. Sometimes, how 
ever, these same interpersonal proc 
esses operate to form a new congruent 
matrix which involves a change in self 
or behavior. The circumstances under 
which either of these effects take plac e 
are a function of the type of incon 
gruency which occurred and of a va 
riety of other conditions which have 


been discussed in some detail. Thus, 


AND CARL W. 


BACKMAN 


a single set of principles are utilized 
to offer a systematic explanation of the 
reasons for stability or change in indt- 
vidual behavior. 

The advantages of adopting such an 
interpersonal theory are spelled out, 
and a brief discussion is included to 
illustrate how it might be applied to 
such processes as socialization, psycho- 
therapy, and dyadic relations. 


REFERENCES 

ABELSON, R. P Modes of resolution of 
belief dilemmas. Paper read at Western 

Psychological Association, Monterey, Cali- 

fornia, April 1958 


\tLporT, G. W Personality: A psych 
logical interpretation New York: Holt, 
1937. 


CartwricutT, D., & HaArary, F. Structural 
balance: A generalization of Heider’s the 
ory. Psychol. Rev., 1956, 63, 277-293 

CattTett, R. B. Personality: A 
heoretical, and factual study 
McGraw-Hill, 1950 

Comps, A. W., & Snyae, D. Individual bi 
havior: A perceptual approach to behavior 
New York: Harper, 1959. 

Eysenck, H. J. The structure of human 
personality. New York: Wiley, 1953 


FESTINGER, | A theory of cognitive dis- 


systematic, 


New York: 


» 
I] 


sonance. Evanston, II Row, Peterson, 
1957. 
Heimer, F. The psychology of interpersonal 
relations. New York: Wiley, 1958. 
James, W. Principles of psychology. New 


York: Holt, 1890. 2 vols 
Lewin, K. A dynamic theory of personality 
New York: McGraw-Hill, 1935. 
Murpnuy, G. Personality: A bioso 

proach to origins and structure 
York: Harper, 1947. 
Murray, H. A. Explorations in personality. 
New York: Oxford Univer. 1938 
Oscoop, C. E., & TANNENBAUM, P. H. The 
principle of congruity in the prediction of 


tal ap- 
New 


Press, 


attitude change. Psychol. Rev., 1955, 62, 
42-55 
Rocers, C. R. Client-centered therapy: Its 


current practice, implications, and theory 
Boston: Houghton Mifflin, 1951 


(Received January 16, 1959) 








Psychological Review 
1961, Vol. 68, No. 1 


REALISM OF CONFIDENCE JUDGMENTS ' 


JOE K. ADAMS 2 


Palo Aito Medical Research Foundation 
The problem of the validity of be- 
liefs has occupied Western philoso- 
phers from the time of the early Greeks 
on down to the present day. The mod- 
ern form of this problem is that of 
finding criteria according to which de- 
justifiably as- 
their 
economists, 


grees of belief can be 


signed to assertions (or nega- 
tions). Psychologists, 
mathematicians, and others have also 
been concerned with this question, usu- 
ally in a less general way, i.e., they 
have usually been concerned with be- 
liefs of more or less specific kinds. 
There is limited 
present either on the normative prob- 
lem of finding adequate criteria for 
rarious degrees of belief or on the 
more empirical problem of measure- 
ment of degrees of belief. Within psy- 
chology the normative problem has 
been attacked primarily on the level of 
finding adequate criteria for holding 
some degree of belief in an assertion 
made professionally, i.e., on the level 
of methodology. The measurement 
problem has been attacked in several 
lines of research, two of which come 
under the headings of “confidence” 
and “subjective probability.” Measure- 
ment of confidence has usually been of 
an admittedly ordinal type, obtained 


very agreement at 


1This research has been supported by 
Public Health Service Research Grant M 
2297 and by Public Health Service Fellow 
ship MF-6977. Grateful acknowledgment is 
made to E. R. Hilgard for several very 
helpful comments on a first draft of this 
manuscript. 

2 Formerly Associate, Applied 
Mathematics and Statistics Laboratory, Stan 
ford University. 

3 Formerly Associate, Depart 
ment of Psychology, Stanford University, 
and Research Fellow, USPHS. 


> 1 
Re searcn 


Research 


PAULINE AUSTIN ADAMS ® 


Menlo Park, California 


by direct judgments by the subject 
(S), whereas measurement of subjec- 
tive probability has usually been of an 
assertedly interval- or ratio-type ob- 
tained by inference from betting deci- 
Except for very restricted 
adequate criteria 
for realism of confidence or subjective 
probability have been lacking. In what 


sions. 


classes of events, 


follows we shall expand these remarks 
and propose a method of measuring 
confidence (or subjective probability ) 


which is both descriptive and, in a 
sense, normative, in that the method 
can be used to obtain a meaningful 
measure of realism in judgment, for 
any of a wide range of classes of events. 
We shall also describe several areas of 
application of this method. 


APPROACHES TO THE PROBLEMS 
INVOLVED IN CONFIDENCE OR 
SUBJECTIVE PROBABILITY 


E-pistemological discussions have tra- 
ditionally been concerned with the gen- 
eral question of whether any of our 
beliefs can be justified;* recently, 
however, a much greater emphasis has 
been put on the problem of the degree 
of justification that can be adduced for 
a given assertion. As with any philo- 
sophical problem, wording rapidly be- 
comes critical, and even to state the 


4 Three classical examples of philosophical 
questioning are the discussions concerning 
belief in the existence of external objects, 
belief in the existence of another person’s 
experience, and belief in the validity of one’s 
own memory. As has been noted by many 
others, to express skepticism about these 
beliefs in any way other than intellectual 
discussion can easily result in behavior that 
is odd, to say the least. 











34 Joe K. ADAMS AND PAULINE AUSTIN ADAMS 


problem is to invite both confusion and 
a challenge. At least from one point 
of view, however, the ill-defined field 
of epistemology can be said to be con- 
cerned with finding some set of prin- 
ciples which enables us to assign, in 
some justifiable way, a degree of con- 
fidence to any assertion contained 
within some class of assertions with 
which we are concerned. 

There have been noteworthy at- 
tempts to develop principles of great 
generality, which hold over a large 
range of assertions, or even over all. 
As in many analogous situations, how- 
ever, there is an inverse relationship 
between generality and the extent to 
which the principles can actually be 
used. Carnap’s system (1950), for 
example, while of extraordinary theo- 
retical generality, cannot be used within 
the foreseeable future except with the 
relatively very limited classes of as- 
sertions covered by much less general 
systems. According to Georgescu- 
Roegen (1958), Carnap’s work has 
made clear the impressive difficulties 
facing any system of comparable gen- 
erality. 

With more limited classes of asser- 
tions, various specialists have attempted 
to develop methods of evaluation which 
enable one to attach a justifiable de- 
gree of confidence, at least approxi- 
mately, to each assertion within their 
particular field. In psychology, for 
example, attempts have been made to 
develop methods for evaluating asser- 
tions such as: 


For a single response, the greater the delay 
in reinforcement, the weaker the reaction 
potential, according to a gradient that falls 
off rapidly at first and then more slowly, 
approaching the asymptote at about 5 sec- 
onds (Hull, 1943, p. 243). Repression de- 
mands a constant expenditure of energy, and 
if this were discontinued the success of the 
repression would be jeopardized, so that a 
fresh act of repression would be necessary 
(Freud, 1925, p. 89). 


These examples are chosen to point up 
the incomplete systematization and the 
disagreement among psychologists as 
to these methods—without any in- 
tended implication that this state of 
affairs is necessarily detrimental to the 
development of psychology. Never- 
theless, we do have some systematiza 
tion and agreement, which sometimes 
become apparent when we talk with 
specialists from other fields, who of 
course have their own more-or-less- 
developed-and-agreed-upon systems. 

There exist at least two large classes 
of assertions for which there are well 
systematized and agreed upon methods 
for arriving at convictions. One of 
these is the class of mathematical prop- 
ositions for which alleged proofs have 
been published—though even here both 
systematization and agreement are not 
entirely complete. For unproved math- 
ematical propositions, however, the sit- 
uation is quite different; for example, 
there is no known way of saying how 
confident one should be that Gold- 
bach’s conjecture (that for every even 
number greater than 2, two primes 
exist such that their sum is that num- 
ber) is correct, despite the existence 
of some partial results and despite the 
fact that many mathematicians strongly 
believe this conjecture—perhaps in 
spite of themselves. There is a dichot- 
omous aspect of justifiable belief in 
a mathematical proposition: either it 
has been acceptably proved and one 
can believe it with certainty, or one 
cannot assign any justifiable degree of 
belief—for example, there is no gen- 
erally acceptable way of justifying or 
discrediting the statement that Gold- 
bach’s conjecture should neither be 
believed nor disbelieved, or that the 
number 0.5 should be assigned as the 
probability that it is correct. 

The other large class of assertions 
for which there exist well systematized 
and agreed upon methods for arriving 








REALISM OF CONFIDENCE 


at convictions is the class of proposi- 
tions amenable to statistical methods. 
Unlike mathematical propositions, to 
these statistically amenable proposi- 
tions we can attach degrees of convic- 
tion which are neither 0 nor 1. It is 
important to recognize the very limited 
range of these propositions, however. 
Such a proposition as that the differ- 
ence in means between Condition A 
and Condition B, in all their concrete 
totality, did not “just happen” through 
random processes should not be con- 
fused with the proposition that a par- 
ticular aspect (or combination of as- 
pects) of Condition A (which is, after 
all, what we as psychologists are ordi- 
narily interested in) results in a higher 
mean than a particular aspect of Con- 
dition B. For the identification of 
variables which are responsible for 
whatever we observe, we must rely on 
our much less systematized and agreed 
upon methods of observation and judg- 


ments based upon knowledge, experi- 
ence, and theoretical convictions. 

In our discussion we have long since 
passed the point of agreement among 


experts in these matters. There are 
almost as many views as to the relation 
of formal theory—as in logic, mathe- 
matics, and statistical methods—to be- 
lief or conviction as there are out- 
standing theorists. To relate formal 
proof to certainty, as in the foregoing, 
is unacceptable to some mathematicians 
and logicians; to identify the numbers 
(p values) which we obtain in statisti- 
cal computations with a degree of con- 
viction is even more controversial 
among those concerned with the foun- 
dations of probability theory and its 
applications. Among psychologists and 
other empirical scientists, however, 
there is, we believe, a strong relation 
ship between p values and convictions 
that specific experimental results can- 
not be accounted for by random proc- 
esses alone, granted that some regard 


JUDGMENTS 35 
the .05 level as quite satisfactory under 
conditions in which others demand that 
p be no greater than .01. For a dis- 
cussion of the thorny problems of sub- 
jective interpretations of statistics the 
reader is referred to Savage (1954) 
and Georgescu-Roegen (1958); both 
authors give numerous references to 
works on the foundations of probability. 

Quite aside from the normative prob- 
lem of methods for assigning various 
degrees of conviction to assertions, 
even the meastirement of the degree 
of conviction with which a given asser- 
tion is made, justifiably or not, by a 
given individual, is surrounded with 
difficulties, at least in the hands of 
some authors. We are referring espe- 
cially to those assertions having to do 
with future events, such as the results 
of random processes (as in gambling) 
or events such as the increase or de- 
crease in the market price of stocks or 
commoc ities, etc. Economists espe- 
cially have been concerned with these 
events, and they recognized long ago 
that the behavior of an individual in 
a market situation is dependent, not 
upon objective probabilities (even as- 
suming these exist, contrary to the 
strongly held convictions of some ex- 
perts) and monetary values, but upon 
expectations (including subjective prob- 
abilities) and utilities. The economists 
took overt behavior in vivo as their 
prediction goal, and were apparently 
uninterested in the possible use of 
direct verbal estimates of probabilities 
and in the development of rating scales. 
The dictum that verbalizations of ex- 
pectations usually have little to do with 
behavior seems to have been rather 
well accepted by economists and others 
working in the field of utility theory 
and subjective probability. Betting be- 
havior, on the other hand, was accepted 
as indicating something about subjec- 
tive probabilities, though it was gen- 
erally agreed that, as subjective proba- 





36 Joe K. ADAMS AND 
bility and utility work jointly, one could 
not be measured without assumptions 
about the other (Davidson & Suppes, 
1957). von Neumann and Morgen 
stern (1947), for example, assumed 
that subjective probability is equal to 
objective probability ; quite aside from 
the tenuous nature of this assumption 
it is not very helpful, because in most 
practical situations objective probabili- 
ties cannot be clearly defined, much 
Recently, 
Davidson and Suppes (1957) devised 


less computed. however, 
a method of measuring both utility and 
subjective probability, in a 
situation. 

Psychologists have been more san 
guine about the use of verbalizations 
of convictions, and published 
many studies which include ratings in- 
dicating the degree of confidence with 
which various have 
made. That these ratings have some 
validity is shown by the frequent find- 
ing of a positive monotonic relationship 
between confidence (indicated on a 
verbal or numerical ordinal scale) and 
some measure of adequacy of perform- 


betting 


have 


decisions been 


This relationship implies some 
rating not 


ance. 
validity in the confidence 


only as a measure of conviction, but 
also as an index of the validity of the 


conviction. Possibly the earliest re 
sult of this kind is that reported by 
Peirce and Jastrow in 1884. The usual 
finding has been, however, that whereas 
within a given individual’s judgments 
there is a high relationship between 
confidence and performance, the rela- 
tionship between average confidence 
and performance individuals 
shrinks to approximately zero. For a 
review of the literature on confidence 
judgments the reader is referred to 
Johnson (1955). 

A limitation of the usual ordinal 
scale is that whereas with it one can 
demonstrate a monotonic relationship 
between confidence and performance, 


across 


PAULINI 





AusTIN ADAMS 
one cannot conclude whether the de- 
gree of confidence is realistic in the 
sense of being appropriate to the per 
formance or not, either in general or 
at any specific point on the confidence 
scale. There are two exceptions to this 
statement. If the lowest point on the 
confidence scale indicates that the indi- 
vidual expects his performance at that 
degree of confidence to be no better 
than chance, or if the highest point in- 
dicates that he expects his performance 
at that degree of confidence to be per- 
fect, it is then possible to assess the 
appropriateness at one or both of these 
points. The finding by Peirce and 
Jastrow of a discrepancy between ex- 
pectation and performance at the point 
of pure guess has been replicated many 
times since (Adams, 1957a), though 
the interpretation of this phenomenon 
is controversial (Goldiamond, 1958). 
[t seems plausible that the failure to 
realism of confidence accounts 
for the relative neglect of confidence as 
an important variable in behavior, de- 
spite the reports of studies involving 
this variable appearing 
since 1884. It may be that realism of 
confidence is in many situations a more 
important variable than level of per- 
formance itself. To take one example, 
it might be more important, in terms 
of his future work, social interactions, 
confidence in himself, etc., for a student 
to be able to discriminate realistically 
between what he knows and what he 
does not know than it would be for 
him to know considerably more than 
he does know without such discrimina- 
Realism of confidence seems ob- 


assess 


sporadically 


tion. 


On the basis of some informal observa 
tions a hypothesis which seems worth test- 
ing is that realism of confidence is a vari- 
able which would discriminate between those 
who have simply “picked up” their general 
knowledge from conversations and mass com- 
munication media and those who have learned 
through formal education or from authori- 
tative sources. 





REALISM OF CONFIDENCE JUDGMENTS 


viously of importance, not only for 
scholars but in general for individuals 
making decisions having consequences 
of importance, either for themselves 
or for others. To be realistic in the 
sense of being able to assign probabili- 
ties of outcomes accurately does not, 
of course, imply being “realistic” in 
the sense of never attempting to ac- 
complish anything which seems highly 
improbable. The 
more accurate 


word “cautious” is 
for the latter meaning 


METHOD OF MEASURING 
CONFIDENCE OR SUBJECTIVE PRos- 
ABILITY: AND REALISM OF 

CONFIDENCE 


PROPOSED 


The method which we have used is, 
in a sense, absurdly simple. Like many 
other psychologists we have done ex- 
actly what some writers in the field of 
subjective probability have asserted to 
be useless, i.e., we have used direct 
verbal estimates instead of attempting 
to infer confidence or subjective prob- 
ability from other behavior. Those 
who have shunned this method, while 
undoubtedly correct with respect to its 
crude usage in betting situations, have 
apparently overlooked the frequently 
demonstrated ordinal relationship be- 
tween confidence ratings and perform- 
We 
took this relationship as our starting 
point and simply defined the confidence 


ance, over a wide range of tasks.® 


scale to S in terms of expected per- 
centages (Adams, 1957b), so that S 
understands that of all those decisions 
about p% 
understands 


made with confidence /p, 


should be correct.* He 
® Recent examples of this relationship ar¢ 
provided by L. Decker and I. Pollock (1958) 
and by E. Carterette and M. Cole 
Several other researchers are working along 
the same lines 
*The term 
used because 
tended 


“expected percentages” was 
it accurately expresses the in- 
instructions to S. 


probability” has been 


meaning of the 


The term “subjectiv 


(1959) .’ 


100 


PERCENTAGE CORRECT 





Jj 
100 





1 
p 


CONFIDENCE 


Fic. 1. A realism of confidence function, 
showing the percentage correct, P, as a 
function of confidence, p. (The 45-degree 
line shows the ideal, i.c., P = p.) 


that if he 


decisions at 


makes a large number of 
each of several points on 
the confidence scale, then, if he is being 
realistic in his confidence, the experi- 
mental points when plotted should fall 
near the 45-degree line shown in Fig- 
ure 1. Definition of the confidence 
scale in this way has at least two ad- 
vantages over verbal or numerical or- 
dinal scales: First, performance can be 
compared with expectation at all points 
along the scale. Second, individual 
differences in the meanings of various 
points on the scale are reduced. 

If we say that ideally each decision 
made with confidence p has independ- 
ently the probability p of being correct, 
then the sample percentage correct, P, 
should be distributed according to the 
terms of the binomial (p+ q)", in 
which n is the of decisions 
— a 


number 


made with confidence p and q = 1 
used with several different meanings and 
is not as appropriate. 

8 For theoretical reasons alone we would 
expect the binomial to give the distribution 
of P only approximately. First, although it 
with experimental ar- 
rangements, to insure that the correctness 
f the decision of the kth decision is inde 


is possible, some 





Joe K. ADAMS AND Pat 


The expected value of P would there- 
fore be p. Actually P may be much 
larger or smaller than p; the points 
plotted in Figure 1 could, if based on 
sufficiently large sizes of n, show con- 
siderable underconfidence at the lower 
end of the confidence (p< FP) 
and considerable overconfidence at the 
upper end (Pp >P). We refer to the 
extent to which the points P; approx- 
imate the points ~; as realism of con 
fidence. Several different realism 
confidence scores can be defined; the 
entire set of deviations p;—P;, as a 
function of /; is called the realism of 
confidence function. 

Probably the earliest use of a con- 
fidence scale similar to ours was re- 
Tanner, and Birdsall 


scale 


ot 


ported by Swets, 
in 1955 in the detection of visual sig- 
nals. These authors, although refer- 
ring to the points on their scale as a 
posteriori probabilities, admittedly treat 
their data as though the scale were 
ordinal only. We would a step 
further and assert that their data show 
their observers to be for the most part 
underconfident in their judgments. 
The scale under discussion differs from 
an absolute scale such as weight (in 
pounds) in that, although S may not 
have learned to fit his doubts to the 
scale, the objective meaning of the 


or 
$9 


points on the scale is independent of 
information not immediately available 
to S, such as the mass of an object kept 
in the International Bureau of Weights 
and Measures, or the subjective heavi- 
ness of a familiar object such as a 
This is not 


pound of margarine. to 
pendent of the correctness of the decision of 
the (k—1)th, it is virtually impossible to 
prevent some positive or negative autocorrela- 
tion of the confidence judgments themselves. 
Any nonzero autocorrelation will tend to 
flatten the distribution of P. A second fac- 
working in the same direction that 
the confidence judgments at a point /: actu 
range of confidence rathet 


tor is 


illy represent a 
than a single degree. 


LINE AUSTIN ADAMS 


deny that in some instances making 
concepts of percentages more concrete 
by training Ss in the judgment of per- 
centages themselves could conceivably 
affect subsequent confidence judgments. 
To the extent that the latter is the case, 
however, it would also be true that 
training in judging percentages would 
modify attitudes toward p values as 
used in statistical analysis of data. 

Let us restate just what the pro- 
posed method involves. Ss are asked 
to attempt to assign a number / to 
each decision in such a way that the 
percentage P of correct decisions within 
a given class (i.e., with a given value 
of p) will behave like a binomial vari- 
ate with mathematical expectation p. 
When we say “behave like a binomial 
variate” we mean simply that it will 
be found useful to use the binomial dis- 
tribution in the analysis of the results 
which are obtained, and that the set 
of differences p; —P; can be used as 
a measure of the extent to which S$ 
achieved what he was asked to do.°® 
Stated in this way, it is clear that this 
method not that there 
exists an objectively definable proba- 
bility that S should ideally give as his 
confidence judgment for a given deci- 
sion, i.e., some probability that can be 
defined independently of his confidence 
judgment.’”® For example, if S assigns 
the number 70 to his decision that the 
statement, “Brussels is the largest city 
in Belgium,” is true, we are not assum- 
ing that, given this S’s state of knowl- 
edge, there is some objectively defin- 
able probability that he should give as 
his confidence judgment; nor are we 
assuming that there is some objectively 


does assume 


® See Footnote 8. 

10In special cases an objectively definable 
probability does exist, for example, in guess 
ing experiments in which events are pro- 
gramed by tables of random numbers, or by 
mechanical random devices. In these cases 
each confidence judgment can be compared 
with the independently defined probability. 





REALISM OF CONFIDENCE JUDGMENTS 39 


definable probability that Brussels is 
in fact the largest city in Belgium. 
Instead, we use the best available in- 
formation as a criterion for determin- 
ing the correctness of S’s decision, and 
thus, using the additional decisions 
made with the same confidence p, we 
are able to compute P for comparison 
with p. S is instructed that he is not 
to be concerned with whether the cri- 
terion itself (for example, data from 
the latest census) is invalid. If S does 
not accept the criteria as a basis for 
the computation of P, he should not 
accept the results as a measure of 
realism of confidence. To do so would 
indicate misunderstanding of the 
method. Analogous statements can be 
made for S’s decision that a given stock 
will close at above a given price on a 
certain future date, etc. 

The method we are proposing has 
three important limitations: (a) a suit- 
able criterion for the correctness of 
each decision must be available, either 
at the time of decision or later; (b) no 
single confidence judgment itself can 
be said to be realistic or not (with the 
exception of certainty paired with an 
incorrect decision, or zero confidence 
paired with a correct decision) ; and 
(c) large numbers of decisions and 
confidence judgments are necessary for 
reasonably precise determination of 
realism of confidence functions. 

Even with the limitations mentioned, 
a realism of confidence function can be 
obtained under a wide variety of condi- 
tions. For example, an S can give a 
confidence judgment in conjunction 
with a psychophysical decision,"* a 
true-false decision, a perception, a mo- 
tor performance, a recall, a perform- 
ance in a social situation, a prediction, 
etc. Further, he can give the con- 
fidence judgment either before or after 

refer to 
confidence 


11To avoid confusion we shall 
any operation about which a 
judgment is made as a “decision.” 


his decision, or both before and after. 
The instructions may or may not in- 
clude the word “confidence”; the S 
may instead be asked how likely or 
probable it is that his decision is cor- 
rect. The scale extends from 100/K 
to 100, with K being the number of 
categories among which S is to de- 
cide.2 When K is very large, as in 
tachistoscopic word recognition, the 
scale extends from 0 to 100.1% The 
interpretation of realism of confidence 
scores depends upon the conditions 
under which the scores are obtained ; 
the quantitative comparability of the 
scores under differing conditions, how- 
ever, offers unusual opportunities for 
making comparisons and studying in- 
terrelationships among psychological 
processes. 


APPLICATIONS OF THE METHOD 


We can illustrate the realism of 
confidence function with examples from 
two experimental procedures which 
differ greatly from each other. Figure 
2 shows realism of confidence functions 
which were obtained from the sorting of 
100 true-false statements on each of five 
consecutive days, a confidence judg- 
ment (on a scale from 50 to 100) being 
obtained with each decision. A differ- 
ent set of statements was used each 
day; they were of a general nature, 
being made up from the World Al- 


12 Difficulty of interpretation may arise if 
the actual events are not chosen by the ex- 
perimenter in equal numbers from each of 
the K categories. For example, a response 
bias toward “true” when S is “just guessing” 
could result in a spurious underconfidence at 
p>=50 if the statements were in fact pre- 
dominantly true, or a spurious overconfidence 
if the statements were predominantly false. 

13 Tt is possible to use the 0 to 100 scale 
in all cases if S is asked to state his con- 
fidence that each event falls in a particular 
category, for example, that each statement 
is true. We are just beginning to explore 
results with this method, for comparison 
with the method previously described. 





K. ADAMS AND PAULINE AUSTIN ADAMS 


10 


Jol 




















—— 
90F o*-« FIRST DAY 
a | **** SECOND DAY 
wl —— THIRD DAY 
= zo} —~- FOURTH DAY 
o oo-o FIFTH DAY 
w 6OF 
rT) 
= 
> SOF 
Wi 
© 40 
= 0 
a. 
30 = 
i oll, 7= " = ++ | i i i i i J 
50 60 70 80 930 100 50 60 70 80 90 100 
CONFIDENCE CONFIDENCE 
Fic. 2. Realism of confidence functi 100 true-false decisions about general 
nformation statements on each of five « utive days, for a psychologist and a 
schizophrenic patient. (Five hundred different statements were used.) 


other general information 
ling 


manac and 
sources and inclu items from the 


fields of history, geography, literature, 


art, sports, and entertainment. The 
left side of Figure 2 shows the re 
sults from S, (a psychologist); the 


shows the results from S$ 
We 
hasten to add that we are not asserting 
that tl I I 


petween 
functions 


right side 


(a hospitalized schizophrenic). 


these sets 


differences 
are 


1e 


lat 
il 


s and hospitaliz d 


typical of differences 


" 


between psychologist 








schizophrenics. S,’s confidence judg- 
ments can be seen to be highly realistic ; 
S, can be seen to be highly overcon- 
fident, especially at confidence judg- 
ments 80 and 90. 
fidence is especially striking in view 
of the fact that S, 
overconfidence at the end of each day’s 


he 


) 
} 


of This overcon- 


was informed of his 


session and at the beginning of t 
next (S, 
S, acknowledged that his performance 
demonstrated overconfidence, and made 


vas given no information). 








100 
INTENTIONAL INCIDENTAL 
LEARNERS om LEARNERS 
5 25 3 ee« | TRIAL 
ws Lyi ..e0- 2 TRIALS 
Fs h/ —. 4 TRIALS 
S f/ =--— Ss TRIALS “i 
50 i” f ~~. 16 TRIALS $3 
wd f - 
°o / ~ , 
< 7 Nf 
z ’ ¥ 
5 fo 
bs : 
w / *y { 
a ; 4 F Ps 
0 es wo 
O 20 30-40 50 60°70 8090 100 18) 0-20 30°40 50 60°70 80°90 100 
CONFIDENCE CONFIDENCE 
3. Realism of confidence functions for recall of nonsense syllables for intentional 
dental | ing groups, with 1, 2, 4, 8, or 16 trials 











REALISM OF CONFIDENCE JUDGMENTS 41 


a strong effort to correct it, but his 
only improvement lay in achieving, on 
the fifth day, 100% correct at con- 
fidence of 100. S,’s percentage correct 
increases appropriately as confidence 
increases whereas S,’s percentage cor- 
rect in the range 50-90 has a flat ap- 
pearance, indicating either that S, was 
not differentiating his feelings of 
doubt and assigning numbers accord- 
ingly or that his differential doubts 
within this range had no validity, at 
least for information 
items. S,’s dogmatic manner in mak- 
ing assertions was in harmony with 
his experimental results. He did not, 
however, question the criteria. We 
feel justified in labeling this patient 
“schizophrenic” on the basis of a num- 
ber of sound criteria, 
assertion that at times he was Jesus 
Christ. 
Figure 3 


these general 


one being his 


shows mean realism of 


confidence functions for university un- 
dergraduates who were asked to indi- 
cate their confidence in their recalls of 
nonsense syllables after 1, 2, 4, 8, or 
16 trials on a list of 30 syllables (20 Ss 
The curves on the 
those 


in each group). 
left are for intentional learners; 
on the right are for incidental learners. 
30th marked overcon- 
fidence,’ incidental learners 


groups show 
* but the 

show a greater overconfidence than the 
Further, the in- 
realistic 


intentional learners. 

tentional learners give 
confidence judgments the more practice 
they have had on the syllables, whereas 
the incidental learners do not improve 
in their realism of confidence. These 


more 


14 The overall measure of overconfidence 
for each S is the algebraic discrepancy score, 


defined as 


in which P, is the percentage correct at con 
fidence p; and mm, is the number of decisions 
with confidence p:. This 

duces simply to the mean confidence minus 
tl otal percent correct. 


made score re- 


results indicate that at least in some 
situations, along with the usual learn- 
ing process, i.e., an improvement in 
performance with practice, there occurs 
an improvement in the ability to assess 
one’s performance in an appropriate or 
realistic manner. 

In general, 
method fall into three areas: 

1. Generality of realism of 
dence, within or between individuals 

2. Training in realism of confidence 
and transfer to different 
conditions 

3. Relations between realism of con- 
fidence and other variables, either 
within or between individuals 

In Problem Area 1 we have merely 
scratched the surface, and shall confine 


applications of the 


confi- 


classes of 


our remarks to a description of this 


area and what we believe to be its 
significance. Ordinarily the possibili- 
ties of studying the generality of any 
psychological process within the indi- 
vidual are severely limited by changes 
in the unit of measurement from one 
set of conditions to another. For ex- 
ample, it is very difficult to compare 
learning of nonsense syllables with 
learning of history within the individ- 
ual, because no comparable units of 
measurement can be The 
same can be said for comparing per- 
ception of words with perception of 
colors, etc. Comparisons between psy- 
chological processes of this kind are 


defined. 


ordinarily possible only in terms of 
individual differences. Clusters within 
individuals, however, can never be de- 
termined by the study of individual 
differences, if different individuals have 
different clusters. Because realism of 
confidence is measured in the same 
units regardless of the kind of decision 
to which it pertains, it is possible 
to compare realism of confidence for 
decisions involving different sense mo- 
dalities, different psychological proc- 
esses (for example, perception, mem- 








42 Joe K. ADAMS AND PAULINE AUSTIN ADAMS 


ory, reasoning), and different motiva- 
tional variables such as stress, risk, and 
anxiety. Presumably “clusters” of 
realism of confidence functions could be 
found within the individual which 
would be interpretable within his over- 
all behavior. So far we have been 
unable to formulate meaningful hy- 
potheses about clusters of realism of 
confidence functions within individuals, 
nor have we yet determined any such 
clusters experimentally. Of interest 
also is the question of generality in 
terms of individual differences, which 
we have demonstrated among several 
tasks but have not yet succeeded in 
organizing in any meaningful way. 
Most of our research has been in 
the second area, i.e., in training Ss to 
make more realistic confidence judg- 
ments and in the transfer of this train- 
ing to confidence judgments about radi- 
cally different decisions. In one ex- 
periment, reported in detail elsewhere 
(Adams & Adams, 1958), we pre- 
sented S with a sequence of 156 pairs 
of words on each of 5 days, using a 
different set of pairs each day. S de- 
cided whether the second word was 
synonymous with, antonymous to, or 
unrelated in meaning to the first, then 
gave his confidence that his decision 
was correct. The confidence scale ran 
33, 40, 50, . . . , 100. At the end of 
each of the first four sessions each S$ 
in the experimental group was shown 
the realism of confidence function that 
he had produced by his previous 156 
decisions and confidence judgments. 
The control Ss were shown only their 
distributions of confidence judgments. 
The experimental group of 14 Ss de- 
creased their mean absolute discrep- 
ancy score from 13.20 to 6.92 (p 


15 The mean absolute discrepancy score 
is defined for each S as 
2! pi — Pil Vni 
> . n 
with meanings as given in Footnote 13. The 


_ processes 


< .01), whereas the control group in- 
creased from 11.16 to 15.22 (not sig- 
nificant). This result is interesting 
when we consider the question of what © 
cue the Ss learned to respond to. It 
seems plausible that this cue was an 
internal one, an event aroused by the 
external stimulus, the pair of words. 
Processes of differentiation and discrim- 
ination of these subtle events seemed 
to be occurring."* We conceptualize 
these processes as of three kinds: first, 
a differentiation of inner events so that 
their occurrences distribute themselves 
more readily into a greater number of 
discrete categories; second, a greater 
and more discriminating awareness of 
these events (which are represented in 
awareness as feelings or attitudes of 
doubt) ; and third, a more appropriate 
fitting of the phenomenally available 
events to the numbers (expected per- 
centages) on the confidence scale. As 
the words were of different frequencies, 
however, and as the percentage correct 
is related to frequency, it was con- 
ceivable that the Ss were learning to 
differentiate and discriminate familiar- 
ity only, i.e., to match familiarity with 
confidence judgments. 

To determine whether a process of 
differentiation and discrimination of 
feelings of doubt might occur, a trans- 
fer experiment was performed, with 
the Ss (Stanford undergraduates) be- 


weight ¥m; was used because of its relation 
to reliability; however, substituting m, for 
Vv m did not affect the conclusions. 

16 Certain alternative explanations, such as 
that the Ss decreased their discrepancy 
scores simply by raising or lowering all 
their confidence judgments, are discussed and 
rejected by Adams and Adams (1958). The 
extent to which increased motivation ac- 
counts for these results, as well as those of 
the transfer experiment which followed it, 
is indeterminate. Our statements about the 
involved in improvement leave 
open the question of the relative roles of 
motivation and of practice and feedback 


per se. 





REALISM OF CONFIDENCE JUDGMENTS 43 


ing trained on decisions different in 
kind from those on which they were 
finally tested.7 On the first day S 
made 108 decisions about the percent- 
age of blue dots in a set of blue and 
red dots and the percentage of total 
length a blue line was of a blue and 
red line. On the second and fourth 
days S decided on the truth or falsity 
of 250 general statements. On the 
third day S lifted weights while blind- 
folded and decided upon the ordinal 
position of the weight in the series. 
Confidence judgments were given with 
all decisions, with the scale varying 
from 10-100 for the percentage deci- 
sions to 50-100 for the true-false deci- 
sions. Each of the eight experimental 
Ss was shown his realism of con- 
fidence function at various points in 
the training ; the eight control Ss were 
shown only their distributions of judg- 
ments, as before. On the fifth day 
each S made 256 decisions as to the 
synonymity, antonymity, or unrelated- 
ness of word pairs, as in the previous 
experiment, giving confidence judg- 
ments ranging from 33 to 100. All Ss 
were run individually. The mean ab- 
solute discrepancy score for the ex- 
perimental group was 9.28, whereas 
that for the control group was 13.61 
(t = 2.79; p < .02). Comparing these 
results with those obtained in the pre- 
vious experiment, we conclude that 
some transfer of training did occur, 
though as we would expect it was not 
100% transfer. 

We believe these results are of theo- 
retical interest for 
First, the explanatory mechanism that 
seems most plausible to us is some 
kind of differentiation and discrimina- 
tion of internal 
doubt”—aroused by the demands of 


several reasons. 


cues—“feelings of 


an external situation. Second, the par- 


17 David Reynolds ran the Ss in these 


experiments. 


ticular content of this learning process 
may have far reaching implications. 
In a sense, our Ss became more real- 
istic in their expectations about the ex- 
ternal world, as well as about their 
own performances. This method of 
training may provide a way of improv- 
ing the ability to discriminate knowl- 
edge from faith and opinion. There 
may be important applications of this 
kind of training, for example, in formal 
education, although our one attempt to 
use this method in a classroom situa- 
tion failed to show positive results; an 
attempt to train Ss in groups in a lab- 
oratory experiment was also unsuccess- 
ful. Of course it would also be possi- 
ble to train Ss to be unrealistic— 
to be overconfident or underconfident 
Training of the latter kind would be 
reminiscent of level-of-aspiration ex 
perimentation (Lewin, Dembo, Fes- 
tinger, & Sears, 1944), in which ap- 
parently no S has been deliberately 
trained to be more realistic. 

Although this method of training 
does not solve the classical epistemo- 
logical problem of finding objective 
criteria for beliefs, it may be useful as 
a basis for the development of the 
ability to weigh evidence soundly. It 
can be argued that realism of con- 
fidence over a specified class of asser- 
tions with known truth values is a 
necessary, though not a sufficient, con- 
dition for realism over a related class 
of assertions with unknown truth val- 
ues. In other words, there may be 
little reason to attach much weight to 
an individual’s strong convictions about 
a given assertion with unknown truth 
value if there is demonstrably little 
reason to attach much weight to that 
same individual’s strong convictions 
about a related class of assertions with 
known truth values. By “related” we 
mean to imply some kind of independ- 
ently determined comparability in the 
decision and confidence judgment proc- 











44 Joe K. 





from one class of assertions to 
another. 

The third area, that of relations be- 
tween realism of confidence and other 
variables, is also one which we have 
explored only to a very limited extent. 
Stanford students taking an objective 
final examination in a one-quarter 
course in elementary psychology indi- 
cated their confidence in each answer. 
Fifty-six of these Ss had been studied 
intensively by Richard Alpert and 
Ralph Haber** during the previous 
months as to their achievement moti- 
vation and achievement anxiety, and 
several measures of achievement moti- 
vation and anxiety and fear of failure 


esses, 


on the examination were taken imme- 
diately before and after the examina- 
tion. Correlations of 

.O1) were obtained between absolute 


about .36 (p 


discrepancy scores and fear of failure. 
Overconfidence or underconfidence, as 
measured by the algebraic discrepancy 
correlated approximately .06 
In other words, 
under- 
confidence per se, but an inappropriate 


score, 
with these measures. 
it was not overconfidence or 
assessment of one’s knowledge which 
was related to: fear of failure on this 
examination. That this relationship was 
not simply a function of knowledge 
is shown by the correlations of 


per se ’ 
.16 between total per- 


approximately 
centage correct and fear of failure. 

Each of these 
which we have briefly described is vast. 
‘Unfortunately, large numbers of deci- 
and 
necessary to obtain reliable measures 


areas of research 


sions confidence judgments are 


of realism of confidence. For example, 
although in principle the application of 
8 Grateful acknowledgment is made to 
Haber and Alpert for making data 
available to us, to Paul Farnsworth for his 
cooperation in data, and to 
the National Foundation 
Grant NSF G-3045 for 2 | 


ne on the IRM 650 


their 


obtaining our 
Science Researcl 


ours of computer 


ADAMS AND PAULINE AUSTIN 


ADAMS 





this method to the study of paranoid 
systems (or other cognitive systems) 
is fairly obvious, the preparation of 
suitable stimulus materials alone, in 
order to study a given individual's sys- 
tematic distortions, may be found to be 
both difficult and extremely time con- 
suming. We hope that additional re- 
searchers will be attracted by this ap- 
proach to some of these problem areas. 


REFERENCES 
Avams, J. K. Behavior without awareness 
Psychol. Bull., 1957, 54, 383-405. (a) 
Apams, J. K. A confidence scale defined in 
terms of expected percentages. Amer. J. 
Psychol., 1957, 70, 432-436. (b) 
Apams, P. A., & ApaAms, J. K. Training in 
confidence-judgments. Amer. J. Psychol, 
1958, 71, 747-751. 


Carnap, R. Logical foundation of proba- 
bility. Chicago: Univer. Chicago Press, 
1950. 


“ARTERETTE, E., & Core, M. A comparison 
of the receiver operating characteristics for 
messages received by ear and by eye 
USN ONR tech. Rep., 1959, No. 2. (Con- 
tract Nonr University of Cali- 
fornia, Los Angeles) 

Davinson, D., & Suppress, P. 
ing. Stanford: Stanford 
1957. 

Decker, L., & Potrack, I 

ings and message reception for 


~ 


233-58, 


Decision mak- 
Univer. Press, 


Confidence rat- 
filtered 


speech. J. Acoust. Soc. Amer., 1958, 30, 
432-434 

Freup, S. Repression. In, Collected Papers. 
Vol. IV. London: Hogarth, 1925. Pp. 
84-97. 


GEoRGESCU-RoEGEN, N. The nature of ex 
pectation and uncertainty. In M. J. Bow- 
man (Ed.), Expectations, uncertainty, and 
husiness behavior. New York: Social Sci 
ence Research Council, 1958. 

GotpIAMoND, I. Indicators of perception: 
I. Subliminal perception, subception, un- 
conscious perception: An analysis in terms 
of psychophysical indicator methodology. 
Psychol. Bull., 1958, 55, 373-411. 

Hutt, C. Principles of behavior. 
York: Appleton-Century-Crofts, 1943 

Jounson, D. M. The psychology of thought 
and judgment. New York: Harper, 1955 

Lewin, K., Dempro, T., Festincer, L., & 
Sears, P. Level of aspiration In J 
McV. Hunt (Ed Personality and the 


Ne W 


45 


REALISM OF CONFIDENCE JUDGMENTS 


of visual detection. (Electronics Defense 
Group Technical Report No. 40) Ann 


Peirce, C. S., & Jastrow, J. On small Arbor: Engineering Research Institute, 
differences of sensation. Mem. Nat. Acad. Univer. Mich., 1955. 
Sct., 1884, 3, 73-83. voN NEUMANN, J., & MORGENSTERN, O. 
SavaGE, L. J. The foundations of statistics. Theory of games and economic behavior. 
New York: Wiley, 1954. Princeton: Princeton Univer. Press, 1947. 
Swets, J., TANNER, W., & BirpsaAtt, T 
The evidence for a decision-making theory (Received August 20, 1959) 


New York: Ronald, 


behaz tor disorders. 


1944. Pp. 333-378. 














Psychological Review 
1961, Vol. 68, No. 1 


46 


APPLICATION OF STIMULUS 
SITUATIONS INVOLVING 


AND FRANKLIN KRASNE 


PATRICK SUPPES 





SAMPLING THEORY TO 
SOCIAL PRESSURE’ 


Stanford University 


Much contemporary social psycho- 
logical theory appears to be theory of 
group behavior group. While 
group oriented concepts May be tran- 


qua 


siently useful in classifying and sys- 
tematizing the vast experimental liter- 
ature of social psychology, we hold 
that ultimately group behavior can 
be explained entirely in terms of the 
behavior of the individuals who con- 
stitute the group. In other words we 
regard the theory of social behavior as 
a highly important special case of the 
general theory of individual behavior. 
Furthermore, to minds 
quantitative formulations of stimulus- 
response-reinforcement theory provide 
excellent conceptual tools for effecting 
such a subsumption. 


our recent 


rhis is not to claim that we are pre- 


pared to give a detailed stimulus- 
response-reinforcement analysis of 
every experimental situation now 


considered important by social psy- 
chologists, but it is to claim that the 
general lines of such 
clear and fact may 
detail, as we shall see for a representa- 


an analysis are 
in be given in 
tive example, for a rather large class 
We 
conceive of an interaction situation as 
one in which each member of a group 
(potentially stimuli and 
reinforcements for every other mem- 


of social interaction situations. 


prov ides 


ber of the group with the behavior of 
each member entirely explicable on 
an individual basis given the sequence 


! This research was supported by the Rocke- 
feller Foundation and the Group Psychology 
Branch of the Office of Naval Research. 
[his paper will be included in the Reprint 
Series of the Stanford Institute for Mathe- 
matical Studies in the Social Sciences. 








46 


stimuli and reinforcements im- 
pinging on him. Examples of small 
group experiments analyzed from this 
point of view are Hays and Bush 
(1954), Atkinson and Suppes (1958, 
1959), Burke (1959), and Suppes and 
Atkinson (1960). 

Here we wish to consider the theory 


ol 


of social comparison processes, an 
area of social psychology usually 
spoken of in terms of frames of 


reference, social pressure, group norms, 
and the like, whose experimental work 
is both especially 
particularly well 
experimentation. 

Using strictly the notions of stimu- 
response, and reinforcement, it 
natural to construe the kinds of 
social situations with which we are 
concerned or ‘‘almost 
classical” discrimination experiments. 
The problem, of course, is to identify 
in each experiment just what is to be 
considered as stimulus, what as re- 
sponse, and especially difficult what 
as reinforcement. Once the identifi- 
cations have been made it is easy 
assume that a 
reinforced in some stimulus situation 
will have an increased tendency to be 
emitted on that 
situation. 

In the social situation objective 
stimuli and the behavior of other 
members of a group combine to form 
the relevant stimulus situations for 
each subject. Response classes are in 
principle arbitrary, and if an experi- 
ment is well structured for the sub- 
ject, the experimenter will be easily 
led to a “natural” of 
responses. 


provocative and 


suited to precise 


lt 


1s 


1S, 


as. classical 


enough to response 


future occasions of 


classification 








APPLICATION OF STIMULUS SAMPLING THEORY 47 


Considerable difficulty arises in 
making reinforcement identifications, 
because subjects bring with them to 
an experiment a large number of cov- 
ert verbal responses having secondary 
reinforcing properties. Consequently, 
we are simply forced to limit the 
number of reinforcers which we wish 
to recognize as important and for 
analytical purposes to ignore the rest. 
We shall suppose first that social 
support per se is reinforcing; although 
admittedly there will frequently be 
factors working against it, we will 
consider them explicitly when they 
seem important. The proprioceptive 
stimuli produced by an overlearned 
response have consistently preceded 
reinforcement, and therefore the mak- 
ing of an overlearned response is self- 
reinforcing via the mechanism of 
secondary reinforcement; thus, we 
shall assume secondly that there is 
some automatic (secondary) reinforce- 
ment for making responses which 
have been well overlearned in every- 
day experience. Lastly, there may 
be experimenter-controlled rewards 
of a less ambiguous nature such as 
money payoffs, prizes, and primary 
drive reduction. 

As a specific example of the type of 
analysis which we are suggesting we 
shall study in detail in the remainder 
of this paper an experimental situa- 
tion which is similar in certain respects 
to the classic experiments of Sherif 
(1935) and in other respects to those 
of Asch (1956). The theoretical 
ideas which have been described will 
be embedded in a stochastic model of 
the general type described by Suppes 
and Atkinson (1960), which is a 
variant of the stimulus sampling 
theory of Estes and Burke. 

In the experiment to be described 
subjects were required to make a 
choice on the basis of an objective but 
slightly ambiguous stimulus situa- 
tion; in particular they were asked to 


indicate which of two lines they 
thought was longer. This choice was 
followed by an indication of what the 
subjects were instructed to believe 
was the correct answer. 

From a social psychological point 
of view the subject plus the experi- 
menter form a dyad which is the 
simplest case of a small group. Ac- 
cording to the social orientation the 
experimenter exerts (social) ‘‘pres- 
sure’ on the subject to modify his 
choices in the direction which the ex- 
perimenter calls correct. In_ this 
sense there are certain similarities to 
Asch’s (1956) study; however, since 
the subject does not find out what the 
experimenter considers as correct until 
after his own response has been made, 
one might suppose, following Sherif 
(1935), that the effect of the social 
pressure imposed by the experimenter 
is to modify the subject’s frame of 
reference on future stimulus pres- 
entations. 

Our approach to the problem will 
be to treat the experimental situation 
as a stimulus discrimination experi- 
ment. The situation is complicated 
slightly in that subjects have strongly 
overlearned a relevant visual discrimi- 
nation prior to this experiment; we 
acknowledge the strong effect of this 
past learning by positing a secondary 
reinforcer which may operate to ac- 
centuate or attenuate the effect of 
the experimenter-controlled reinforcer 
of social support. 

It is the conflict between past learn- 
ing and the experimenter-controlled 
reinforcer which generates social pres- 
sure that does not exist in the usual 
discrimination experiment. The sub- 
ject holds Opinion A (here interpreted 
as a response probability) and another 
person (in this case the experimenter), 
who has for one reason or another the 
power to reinforce the subject, holds 
B, a potentially different opinion. 
That the person giving the second re- 














48 PATRICK SUPPES AND FRANKLIN KRASNE 


sponse is the experimenter is irrele- 
vant. It could equally well have been 
another subject, and by the 
token we could easily pass from the 
This would 
involve a more laborious but strictly 


same 
dyad to a larger group. 
analogous treatment. 


THE EXPERIMENT AND ITs THEORY 

On each of a sequence of trials, a 
pair of lines, one slightly longer than 
the other, was projected for a few 
The 
were asked to record on answer sheets 
which of the two lines (labeled Line 1 
and Line 2) 
They were then told (what they had 
been instructed to believe would be 
the correct answer ; however, it was in 
reality 
chosen subset of the trials. 


seconds on a screen. subjects 


they thought was longer. 


correct only on a randomly 
In order to describe the situation 
more precisely, let us introduce some 


notation: 


S, = Event of projecting a line pair of which 
Line 1 is longer 

S, = Event of projecting a line pair of which 
Line 2 is longer 

1, = Re sponse ol s ibject on answer sheet in- 
dicating that he thinks Line 1 is longer 

1, = Response of subject on answer sheet in- 
dicating that he thinks Line 2 is longer 

E, = Reinforcing event of experimenter sa‘ 


ing, “‘Line 1 is longer.”’ 
E, = Reinforcing event of experimenter say- 


ing, “Line 2 is longer.”’ 


Q, = Secondary reinforcement of A; response 
Q, = Secondary reinforcement of A» respons 
wr, = P(E;|S 

vo = P(E.|S 

= P(S;, 

6 = Probability that a subject makes a cor- 
rect discrimination in a control experi- 
ment where no information is being 
given him regarding the correctness of 
his responses 

We will be primarily concerned 
with the behavior of various condi- 


tional probabilities of response as a 
function of 7, m2, and 6. 

On any trial first a stimulus event 
(S; or S2), then a response (A; or A2), 


and then a reinforcing event (/, 
or Ee) occurs. In addition we intro- 
duce automatic secondary reinforcing 
and Q9, which reinforce 
the ‘‘perceptually correct”’ 
Thus, on any trial of an experiment 


events, 


res] OSC. 


the sequence of events may be indi- 
cated by the string of symbols, 


CoS3A>8 9 EOC 


where C and C’ represent conditioning 
before and after the trial and where 
we say that a response is conditioned 
to a stimulus event if the response is 
that stimulus 
At all times exactly one re- 
sponse is conditioned to a particular 
stimulus event. 

Independently of what 
was actually made on a trial we will 
say that both E, and Q reinforce Ax. 
then 
the reinforced response becomes con- 
ditioned to the stimulus event occur- 
ring on the trial. If no reinforcement 
is effective, then conditioning remains 
unchanged. It will be assumed that 
exactly one of the following occurs on 
every trial: social support is effective, 
secondary self-reinforcement is effec- 
tive, neither is effective. Since these 
events are mutually exclusive and 
exhaustive, we define 


elicited as a result of 
event. 


response 


lf a reinforcement is effective, 


6, = P (Axis effectively re- 
inforced by Q.|Q, E;) 


J) 


6. = P (A; is effectively re- 
inforced by E;|/Q, E;) 
1 — 0, — 6. = P (No reinforcement is 


effective |Q,, E;) 

Consider the subsequence of trials 
on which S; and So, respectively, 
occur. Since the conditioning of a 
response to a stimulus event can be 
iffected only if that stimulus event 
occurs, we can treat these two subse- 
quences as separate and independent 
simple learning situations (simple 
learning as opposed to discrimination 








\PPLICATION OF 


learning). In particular it is easily 
that each learning i 
characterized under our assumptions 
as a Markov chain with A; and A, 
as its states (that is to say on any 
trial either 
to S;). 


seen process 1S 


A, or Ase is conditioned 


{A, conditioned to $;}—S;—A;—_Q, 


STIMULUS SAMPLING THEORY 49 


As an example of the method used 
in deriving the transition matrices for 
these chains, we shall analyze in terms 
of a tree of logical possibilities a trial 
on which S; occurred. If the subject 
ended the last S; trial conditioned to 
A,, then the relevant tree is 





A;) 
6; / 
KF Be 
E, a. A, 
us| 
, 1—6,—6. 
4 -conditioned to S; 
1—7, i) 
0. 
E. —As 
1—06,—6. 
A,} 


and if the subject ended the last S; trial with A» conditioned to S; we have 


{A» conditioned to S;}—S;—A:—Q, 


Using similar methods we arrive at 
the following Markov chains for the 
S, and S, learning processes, respec- 
tively. On S, type trials we identify 


two states, 1 and 2, referring to A, 


A, } 
6; J 
Be 
E — A, 
a 
f 
Tif 
Pd 1—6,—0. . 
A» a . 
A, conditioned to S$; 
. J 
1—m\. 6; 
Os 
E.—- As 
1—6,—6: % 
A> 





and A», respectively, as being condi- 
tioned to S,; this gives the transition 
matrix 
1— (1 —71)02 
6:+7102 


(1—7)02 


r1) 
1—0,;— 7162 - 


4 








50 PATRICK SUPPES AND FRANKLIN KRASNE 


Similarly on S; trials identifying states 
1 and 2 referring to A; and A», respec- 
tively, being conditioned to S, the 
transition matrix is 


yer 6,:+ 7282 


? 
Pach ine [2] 


Using the fact that the sequence of 
S;’s and S,’s occur in accordance with 
a binomial distribution with param- 
eter y, we may combine the above 
processes to obtain P,(A,|S1) and 
P,,(A,|S2) representing the probabili- 
ties of A; responses on the mth trial 
of the full experiment given S; and S 
stimulus events, respectively ; 


(1—72)A 


P,(A,| Si) 

A; } 6; , ) 

a TT . ; °” 

= $1 g— 
6, 6; 
8." s” } 
<1 — + (0;+62) }"—" [3] 

Pp (A, 5 

1—7e 1-3 Sah ) 

6 ae 

+] + 
8 L 6 


x[1—(1—y) (01.462) J" [4] 
n=1,2,--: 


The psychophysical constants 6 and 
(1 — 6) have been taken as the initial 
probabilities of A; responses on S; and 
S; type trials, respectively. It is clear 
that since 0 < 6; + 6. < 1, the right 
hand terms of each expression vanish 
for large m, and hence, the asymptotic 
results are given by the first terms to 
the right of the equality signs. We 
have graphed the asymptotic quanti- 
ties as a function of the ratio 6,/@2 for 
relevant values of x; and 72 in Figure 1. 


METHOD 
Stimuli 
The stimuli for these experiments were 
pairs of parallel lines projected on a field 46 








Fic. 1. Asymptotic behavior of P(A,|5S;) and 
P(A,|S:2) as a function of the ratio 6;/@2. 


inches wide by 32 inches high; the lines were 
oriented so that a line passing through their 
midpoints would be perpendicular to each. 
Subjects were run under two different stimu- 
lus conditions, which we shall designate the 
“hard” and “easy” conditions to refer to their 
involving hard and easy discriminations, re- 
spectively. Within each difficulty condition 
(hard or easy) there were two types of slides 
used (A and B in Table 1), and the orienta- 
tion of the line pairs was permuted so as to 
produce 2 X 2 X 2 = 8 different line pairs 
(e.g., lines vertical, lines horizontal, long line 
on left, long line on right, etc.). The lines 
were labeled 1 and 2, respectively, for identifi- 
cation by the subjects. In order to avoid the 
effect of position habits on the data, for half 
the subjects the upper horizontal and left 
vertical lines were always labeled 1 and for the 
other half the bottom and right lines were 
called 1. 

In Table 1, which gives a description of the 
line pairs, a is the distance between the lines 


TABLE 1 


DESCRIPTION OF STIMULI 


Easy 


ele b ¢e|8 


A | 15.5} §.12| .240|.72) 15.5] 2.56] .310| .92 
B | 15.5] 7.75] .250|.72| 15.5| 7.75| .340|.92 
J 








Note.—a, 6, and ¢ are measured in inches. 








\PPLICATION OF STIMULUS SAMPLING THEORY 51 


rABLE 2 
Group DESCRIPTIONS 


: | 
Group 


Number| 7! | * 5 Description 
I |.7| .2| .92) Easy, low symmetry 
Il | .7!| .2! .72) Hard, low symmetry 
III | .7| .6)| .72)} Hard, medium symmetry 


of the pair; } is the half-length of the shorter 
line; ¢ is the difference in half-lengths between 
the longer and shorter lines; 6 is the prob- 
ability of the subject making a correct dis- 
crimination when no experimenter controlled 
reinforcing events are influencing his behavior. 


Room and Apparatus 


The line pairs were projected by an Argus 
300 Projector with a Sylvania 300-watt pro- 
jector lamp on to a beaded screen 146 inches 
The subjects (from one to five in num- 
ber) were seated at a distance of about 96 
inches from the front of the screen. The room 
illumination was about .3 Weston II units. 

The subjects responded on answer sheets 
prepared in essentially the same way as stand- 
ard IBM score sheets. For half the subjects 
the left column was used to indicate an A; 
response and for half the right. 


away. 


Experimental Groups 


Three groups of subjects whose conditions 
are described in Table 2 were run. The degree 
of symmetry under “Description” refers to 
relative similarity of reinforcing situations on 
S, and S: type trials. For all groups y = .5. 


Subjects 


The subjects, who were students at Stanford 
University, were obtained from the student 
employment service, an introductory psychol- 
ogy class, and a university dormitory. With 
the exception of a few of those from the psy- 
chology class all subjects were male. There 
were 26, 25, and 18 subjects in Groups I, II, 
and III, respectively. Because of certain 
technical difficulties in preparing the stimuli, 
subjects were not placed randomly in groups. 


Procedure 


There were from one to five subjects per 
experimental session. The subjects, having 
been seated and given answer sheets, were 
given the following instructions. (A sample 


slide was projected on the screen throughout 
the instructions. ) 


This is an experiment on judgment of 
length. I am going to flash a number of 
slides on the screen each of which has two 
lines on it; your job in each case will be to 
decide which of the two lines is the longer 
and to mark your judgment on the answer 
sheet in the appropriate column opposite 
the number of the slide we are on. For 
example: if on the first slide I showed, the 
line next to the figure one on the screen was 
longer, you would mark the answer sheet 
like this [show marked answer sheet]. 
Sometimes the pair of lines will be vertical 
rather than horizontal; then the line here 
would be line one and the one here line 
two [point to screen to illustrate what I am 
saying]. Please mark the answer sheets 
heavily and completely as though you were 
filling in an IBM score sheet. I will an- 
nounce the number of the slide we are on 
before each judgment and will tell you the 
correct answer after each judgment. You 
must make your decisions quickly and re- 
cord them BEFORE I tell you the answer. 
Each slide will appear for about 2 seconds. 
One of the two lines will always be longer 
than the other; however, some of the judg- 
ments may be difficult. If you are not 
sure which line is longer, then guess. Since 
I want you to work completely independ- 
ently of each other, I must request that you 
remain absolutely silent during the experi- 
ment. Are there any questions? 


The trials then proceeded at a rate of about 
seven per minute. Each stimulus presenta- 
tion lasted for approximately 2 seconds. Oc- 
casionally the experimenter found it neces- 
sary to repeat early in the sessions ‘‘this is a 
test of sorts; you must remain completely 
silent.’ Each subject received a total of 250 
trials. 

After the experiment the subjects were in- 
structed: ‘Will you please write a brief state- 
ment of your reactions to this experiment?” 
The responses to this question indicated that 
the subjects understood and believed the 
instructions. 


ESTIMATION METHODS 


Two methods will be presented for 
estimating the parameters 6; and 62 
from the data of our experiments. 
First we shall make maximum likeli- 
hood estimates based on the separate 


S; and S» response subsequences 











(Anderson & Goodman, 1957); then, 
taking advantage of the mutual inde- 
pendence of the S; and S: learning 
processes, we shall maximize the joint 
likelihood function to obtain single 
estimates of 0; and @.. 

It can be shown that the 
method is essentially equivalent to 


first 


setting the expressions for the theo- 
retical transition probabilities equal 
to their estimates and solving for 6, 
and 62. oes 

The estimates based on maximizing 
the joint likelihood function are not 
quite so simple. The derivation, 
which is reasonably straightforward 
but tedious, leads to a tenth degree 
equation for the unknown, @2; solu- 
tions were obtained by approxima- 
tion methods. 


RESULTS AND DISCUSSION 


Tables 3 and 4 


parameter estimates obtained by the 


present learning 
two methods described above. 

The most important observation to 
be made is that for all estimates 6,/0, 
is less on hard than on easy discrimi- 
nations. One can go even further and 
point out that with the exception of 
the S. trial estimate in Group III this 
learning parameter ratio is greater 
than one for the easy discriminations 
and less than one for the hard ones. 
This says essentially (for our experi- 
mental situation) that if the physical 
stimulus situation is fairly unambigu- 
ous, subjects will “‘prefer’’ to derive 
their reinforcement from it via Q; and 
2. while if it is not, they will depend 
primarily on social support. — This 
makes good intuitive sense and in fact 
has been adopted as a postulate by 
Festinger (1954) in his theory of 
social comparison processes. 

In spite of the fact that we get 
these intergroup differences for all 
estimates, it is interesting and rather 
odd that, as will be seen by studying 


52 PATRICK SUPPES AND FRANKLIN KRASNE 


TABLE 3 


SEPARATE MAXIMUM LIKELIHOOD EsTIMATES 
OF 6;, 62, AND DERIVED QUANTITIES 


Which Estimates |. Quan- | Group | Group | Group 
Based 

A, .60 | .38 | .30 

S 6. 20 54 .62 

6, /b» 3.0 71 48 

6, +6 78 93 91 

é, 42 | .36| .48 

- 6. | .27 36 45 
6,/62 | 1.6 10 | 1.1 

6, +6.) .69 72 .92 


Table 3, the effects described above 
show up much more markedly on SS; 
than on S» type trials. Since the 
actual stimulus and response labeling 
was randomized over subjects, and 
since the stimuli and responses were 
physically symmetrical, the relation 
between a, and r2 must be responsible 
for this rather surprising phenomenon. 

Reference to the tables under dis- 
cussion will show that the over-all 
effectiveness of reinforcement, 6; + 62, 
increased from the easy to the hard 
conditions. One might argue that 
the harder a discrimination, the more 
attentive the subject will be to all 
events surrounding him; thus, rein- 
lorcements are generally more effec- 
tive when 6 is small. Although it is 
true that via Equations 3 and 4 one 
can estimate the ratio 0;/0. directly 
from asymptotic data, we have pre- 
ferred to make much more efficient 


rABLE 4 


Joint Maximum LIKELInOop EsTIMATES OF 
9;, 82, AND DERIVED QUANTITIES 


Quantit Group I Group II Group III 
4, 44 39 40 
uP 25 42 52 

4, /8. 1.8 .92 77 

A + 6. .69 81 92 








APPLICATION OF 


estimates of this ratio by making 
strong use of the Markov properties of 
response subsequences. Specifically, 
the ratio 6;/6., where 6, and 6. are 
maximum likelihood estimates, yields 
the maximum likelihood estimate of 
0/6. 
phasize that the extent of agreement 
between observed asymptotic beha- 
vior and the predicted 
using the maximum likelihood esti- 
mates of 6; and 62 is an empirical ques- 


Consequently, we wish to em- 


asymptotes 


tion representing one sort of test of the 
The predicted asymptotic 
conditional response probabilities have 
been included in Table 5, which also 
presents for purposes of comparison 
the relevant relative frequencies over 
the last 50 and 100 trials for each 
group. Although the fit is not perfect, 
the observed and predicted rank or- 
ders across groups are in agreement 
for the predictions based on the joint 
maximum likelihood estimates. 


model. 


Observed mean learning curves on 
which the predicted asymptotes (of 
the last line of Table 5) are indicated 
as dashed lines are plotted in Figure 2. 
In order to determine whether a 
significant amount of learning oc- 
curred, the conditional response pro- 
portions from the first and last 50 
trials of each experiment were com- 
pared by two-tailed sign tests (Siegel, 


STIMULUS SAMPLING THEORY 53 


1956); the significance levels a are 
shown next to the appropriate curves. 

The sign test, which was used here 
because the distribution assumptions 
necessary for it are almost negligible, 
does not produce acceptable signifi- 
cance levels. Higher level (a < .01) 
tests may be obtained if we are willing 
to make the rather strong assumption 
that the individual 
particular subject independent 
Bernoulli events. However, this as- 
sumption is too grossly violated to al- 
low one to place much confidence in 
results based upon it. 

If we draw learning curves (Equa- 
tions 3 and 4) using the maximum 
likelihood estimates of 6; and @., we 
find that the curves their 
asymptotes much more rapidly than 
do the observed curves of Figure 2. 
We wish to point out that this does not 
represent a fundamental problem for 
the present theory although it limits 
the usefulness of the specific form of 
the model which we have presented. 
The problem arises from our having 
assumed that only 
“element” or “pattern” 
1959) was associated with each stimu- 
lus event, S;. By positing a larger 
number of such stimulus patterns and 
retaining the supposition that exactly 
one is sampled on each trial, the 


responses of a 
are 


rise to 


stimulus 
(see 


one 


Estes, 


TABLE 5 


OBSERVED AND PREDICTED ASYMPTOTIC CONDITIONAL RESPONSE PROBABILITIES 


Observed: Over last 100 trials 
Observed: Over last 50 trials 
Theoretical: Based on S, trial 
estimates 
Theoretical: Based on Sz trial 
estimates 
Theoretical: Based on 
joint MLE 


Group II 


P(A1|S2) 


40 
.39 
48 
40 


42 














54 
— 
o —-—o 
i saad " ort 
oO ‘ 
j 
= 
a 
- = a 
> c@ = o%y 
— qe _ 
gos 
7 \ y 
4 -——_ — = - a ~e 
SF oOo -< 
a a 
| {a = .c26) 
on 
4 





~« te. 
ia = 
| 
« ‘ 
4 ze 
cry FF A 4 
f eae ebro 
| ne ~~ 
| 
Fic. 2. Mean learning curves. 


(Blocks of 25 trials.) 


theoretical learning rate may be de- 
creased at the expense of considerably 
more tedious computations. 

We may take a compromise ap- 
proach, however. 
to be N stimulus elements associated 
situation, then 


lf we suppose there 


with each stimulus 


PATRICK SUPPES AND FRANKLIN KRASNE 


although most formulae become con- 
siderably complicated, the learning 
curves 3 and 4 are changed only in 
the last term; let 6;/@. = « then we 
may write 


. e+ e+ 
P,(A1| $1) = +{6—— 
e+1 e+1 


> (4.16 _1-—m (1 f =") 
P,,(A1| So) = tit paneer 


A. Vs 
x[1- 1-9 wit | 


We may estimate ¢ directly from 
asymptotic data and choose @2/N to 
make a least squares fit of the theo- 
retical to the empirical learning curves. 


CONCLUSIONS 


[n conclusion it is our opinion 
that considerable analytic advantage 
has derived from application of a 
quantitative theory of individual be- 
havior in the present context. Through 
its use we have been able to make 
highly specific remarks about the 
extent of operation of certain events 
which are not directly observable but 
which may reasonably be postulated 
to exist, namely secondary self-rein- 
forcement of responses well estab- 
lished in everyday experience. Also 
we have been able to compare the 
magnitude of the effects of these 
secondary reinforcements to that of 
the more directly controlled social 
support. As far as we can 
would be considerably more difficult 
to obtain comparable information 
from experimentation not oriented 
about a quantitative theory. 

We wish to emphasize again that 
the central ideas behind the present 
analysis are ones of considerable 
generality ; similar techniques of anal- 
ysis can be applied to many social 


see, it 











APPLICATION OF STIMULUS SAMPLING THEORY 


psychological situations by a fairly 
straightforward extension of the ideas 
presented here. 


REFERENCES 


ANDERSON, T. W., & GoopMan, L. A. Sta- 
tistical inference about Markov chains. 
Ann. math. Statist., 1957, 28, 89-109. 

Ascu, S. E. Studies of independence and sub- 


mission to group pressure: 1. A minority 


of one against a unanimous ijority. 
Psychol. Monogr., 1956, 70(9, \\ iole No. 
416). 


ATKINSON, R. C., & Suppes, P. An analysis 
of two-person game situations in terms of 
statistical learning theory. J. exp. Psychol., 
1958, 55, 369-378. 

ATKINSON, R. C., & Suppes, P. Application 
of a Markov model to two-person non- 
cooperative games. In R. R. Bush & W. 
K. Estes (Eds.), Studies in mathematical 
learning theory. Stanford: Stanford Univer. 
Press, 1959. Ch. 3. 

BurKE, C. J. Applications of a linear model 
to two-person interactions. In R. R. Bush 





wn 
P 


& W. K. Estes (Eds.), Studies in mathe- 
matical learning theory. Stanford: Stanford 
Univer. Press, 1959. Ch. 9. 

Busu, R. R., & Estes, W. K. (Eds.), Studies 


in mathematical learning theory. Stanford: 
Stanford Univer. Press, 1959. 

Estes, W. K. Component and pattern 
models with Markovian interpretations. 


In R. R. Bush & W. K. Estes (Eds.), 
Studies in mathematical learning theory. 
Stanford: Stanford Univer. Press, 1959. 
cz. a. 


FEsTINGER, L. A theory of social comparison 
processes. Hum. Relat., 1954, 7, 117-140. 
Hays, D. G., & Busn, R. R. A study of 

group action. Amer. sociol. Rev., 1954, 19, 


693-701. 
SHERIF, M. A study in some social factors in 
perception. Arch. Psychol., 1935, No. 187. 
SIEGEL, S. Nonparametric statistics for the 


behavioral sciences. New York: McGraw- 
Hill, 1956. 

Suppes, P., & ATKINSON, R. C. Markov 
learning models for multiperson interactions. 
Stanford: Stanford Univer. Press, 1960. 


(Received October 5, 1959) 


APPENDIX A 


The model (Model I) originally pro- 
posed for the experimental situation 
studied in the body of this paper dilfered 
in its reinforcement mechanisms from 
the one (Model II) actually discussed 
there. Since rather extensive work was 
done on it and since we consider some 
of its failings to be instructive, we pre- 
sent here some detailed comments on 
Model I. 

Let us call those trials on which the 
subscripts of E, and S; do not agree 
“conflict trials.” Rather than intro- 
ducing secondary reinforcers 2) and MQ, 
we assume that the reinforcing event FE; 
is effective with probability @4 on non- 
conflict trials and with probability 
653 <64 on conflict trials. While we 
make no explicit psychological assump- 
tions regarding the reasons for attenuat- 
ing the learning parameter on conflict 
trials, the intuitive reasons are clear. 

Markov process state identifications 
are exactly the same as in Model II, and 
we here present those results for Model I 
which correspond to those indicated for 
Model II in the body of the paper. On 
S, trials the transition matrix is 





1— (1 — 71)0zg (1 — 71)0z rA1] 
W104 1 — W104 sia 
and on Ss trials it is 
[1 — 1rA4 TH! 4 rA2] 
| (1 —7m)Ogp 1-—(1- 32)0p | os 





The learning curves are 
, er us 
P,(A 1 S1) sesalammanat 
Oz, , 
m+t—(1— 71) 
64 


us| 


cy 
Op 
w1+—(1—7}) 

A, 


XK }1—y[0g(1—71)4 O43, ]}"- CA3] 
' 1—7. 
P,(A So) = _ sea. 


A P 
aot (1— 12) 
Op 


1— 

+/ (1—6)- _ 
64 
To 


1— 72) 
— 


X {1— (1-7) [08 (1— 2) +042]}" 
[A4] 














56 PATRICK SUPPES AND FRANKLIN KRASNE 


And the graphs of the asymptotes as a 
function of 04/8, which are presented 
in Figure Al, should be compared with 
the corresponding ones for Model II. 
Several methods of learning parameter 
estimation were used for this model, and 
each led to the same rather disquieting 
methods the estimates of 
Before con- 


result: for all 
64 were greater than one. 
sidering the reason for this anomaly, let 
us consider the various methods of esti- 
mation from which we obtained estimates 
of 64 and @p. 

Method 1. This method is precisely 
the same in principle as the first method 
presented in the body of the paper for 
6, and @). Jt is a true maximum likeli- 
hood each of the two 
response subsequences. As in Model II 
the method is for a fairly large number 
of counted transitions per subject equiva- 
lent to setting the observed transition 
matrices equ il to the theoretical ones and 
solving. A comparison of the transition 
matrices for the two models shows that 


estimate on 


using this estimation procedure 6. = @pz. 

Method 2. The joint maximum likeli- 
hood estimate here, which is considerably 
simpler than in Model II, may be ob- 
tained explicitly by solving a second 
degree equ tion. 

Method 3. The next proces 
is rather complicated in the 
consider especially in- 


lure, which 
present 


experiments, we 


PA ° 
| s — 
— omnes 
| - 
| Pal 
5 \ 
x 
| \ 
4 
: | . 
T “Ss 
mal 





Fic. Al. Asymptotic behavior of P(A,|.S 
and P(A,|S:) as a function of the ratio 04/02 


in Model I. 





teresting, because it seems to yield many 
of the advantages of maximum likeli- 
hood estimation in situations where true 
maximum likelihood estimation is un- 
feasible. 

Consider the full sequence of responses 
(S; and S» type trials in the same 
sequence now) for a particular subject. 
It is possible to compute for large n 


\i;=P.. (A; on Trial n| A; on Trial n— 1) 


independently of the trial number, 1. 
Thus, while it is true that the sequence of 
responses for a subject is a chain of infi- 
nite order, we may as a first approxima- 
tion treat it asa stationary Markov chain 
transition probabilities, A,;, and 
compute under this assumption the 
maximum likelihood estimate of the 
parameter pair (64, 88); such a procedure 
is an example of pseudo maximum likeli- 
The pseudo likelihood 


with 


hood estimation. 
function is 


L (04, 9n) = [] Ai;"*P2™ (A, 


where P,(A;) is taken asymptotically, 
n; is the number of occurrences of re- 
sponse A; on the first trial of the sequence 
over which the pseudo maximum likeli- 
hood estimate is being taken, and n;; is 
the observed number of transitions from 
response A; to A; over the sequence in 
question. 
Method 4. 
64/8 was used in order to provide a 
check on the above procedures. Let 
P.(A;|S;) and P.(A;|S:2) be the esti- 
mated (over the last 50 trials) asymptotic 
conditional probabilities of response. 
Then the value of 04/@g which minimizes 


One simple estimate of 


A r 
P (hel S).— woe gee 
, Op : 
Ti TF (1 oe Pal 
G4 
A 1—fr 
itsdida~ ™ 
{ 
- We a (1 — Wo) 
Oz 
is a rather natural estimate of 64/6, 


although it uses very little of the infor- 
mation in the data. 

The various estimates of 64 and @, are 
given in Table Al. 








APPLICATION OF 


STIMULUS SAMPLING THEORY 


rABLE Al 


MAXIMUM LIKELIHOOD ESTIMATES OF 64, 08, AND 04/02 


64 on S; trials 
@p on S; trials 
64/0zp on S;, trials 
64 on Sz» trials 
62 on Sz trials 
64/0zn on So trial 


64 
Op 


Che occurrence of learning parametet 
estimates greater than one makes inter 
pretation of Model I very tenuous. It 
will be instructive to consider the prob 
lem in more detail. It will be seen from 
Equations Al and A2 on which our esti- 
mation procedures are primarily based, 
that 64 and @g appear in no equations 
which would restrict their values to the 
It is only in interpretation 


unit interval. 


and in the derivation of transition ma- 
trices that we have made use of the learn- 
ing parameters’ roles as probabilities 


bounded 


looking alt 


Although we obtain 


estimates of 6,4 


may 
and @z by 
detailed sequential properties of the data, 
we find that it is certain inadequacies of 
the learning model at precisely this level 
the cause of 


of analysis which are our 


excessively large learning parameter 
estimates. 

To be more specific let A n represent 
response A; on trial nm, and let E;,, be 
defined analogously. Then, it 
follows from our learning assumptions 


that on S; type 


easily 


trials 


P(A, [A5] 


P(A1i 2 4 ay 44 } { \6 ] 


would now seem 


Although it 


natural to use Equation A6 as the basis 


quite 


mS wnweoue 
nN ray 


2.4 


for a bounded estimate of 64, we find that 
it is precisely because Equation AS is not 
empirically ‘valid that our previous esti- 
mation procedures have yielded estimates 
of 04 greater than one. It can be easily 
shown that using estimation Method 1, 
for example, 0,4 is indeed restrained to the 
unit interval if the relative 
frequency associated with the left side 


observed 


of Equation A5 is near zero, which in the 
present study it unfortunately is not. 
other difficulties the 
present model gives a sufficient descrip- 
tion of important qualitative aspects of 
the data. First, 04 ts consistently greater 
than 6p; comparing Group I 
easy, low symmetry) and Group ITI (hard, 
low symmetry), for which wm, and m2 are the 
same, Group I, which involved the easter 
greater learning 
Although the 


In spite of its 


sec¢ ynd, 


1; m4 ; bac 
discrimination, has a 


parameter ratio 04/0p. 


pseudo maximum likelihood estimates of 


the learning parameters weakly contra 
dict this latter statement, it is our feeling 
that we may place more confidence in 
the true maximum likelihood estimates, 
since their optimality properties are well 
known. 

We should expect under this model no 
change in the @ values as a function of 7; 
and m2 when the same stimuli are used. 
Unfortunately there is such a change; 





58 PATRICK SUPPES AND FRANKLIN KRASNE 


indeed, we notice that when the subject 
makes the less frequently reinforced 
response and ts reinforced, this reinforce- 
ment tends to have a greater probability 
of being effective than it would have 


otherwise had. This can be seen by com- 
paring the @ estimates made on S; and Se 
type subsequences; the @g estimates in 
Group I, however, are contrary to this 
generalization. 


APPENDIX B 


We consider it to be of some general 
interest to present the method of obtain- 
ing the learning curves 3, 4, 7, and 8, 
since essentially the same technique may 
be used to derive learning curves in an N 
element pattern model (Estes, 1959) 
with a probability distribution over the 
elements. 

Let : 


Event of precisely & samplings } 
of stimulus element 7 in first r 


n trials, m = 1, 2, --- 


P(S,))""-" [B4 


J 


In a general 2 X 2 Markov process if 
x, is the probability of being in state one 
on trial m and 


But for the transition matrices for S; 
and Sz type trials, respectively, we have 


7 - b=(1- 1 1)0e 7 
S; trials: 5 [B7] 
a=@,+ 110. 7 
r+b=6,+ 0, 
= 4; +. TA. 


= (1 — m2). 


1+ b=8,+ A. 


which substituting in Equation B4 gives 


rey 


L@ig*”’ “F (ae Os 
=() 


xX P’(S;)(1 — P(S,))*" [B10] 


ee 


n—1 
= an & ( 


y=() 


' ) P’(S;) 


/ 


v 


x [1 — P(s)}" 


xX [1 — 01 — 62) P(S;)] 
xX [1 — P(S;)]}" 
= a, oe \dig*"’ = Ga ) 
x [(1 — 0; — 62) P(S;) 
P(S;) }' 





APPLICATION OF STIMULUS SAMPLING THEORY 


This becomes for S, trials, letting and for S2 trials becomes, letting 


)=6 ao =1—6 


P(A1,n| 51,2) P(Ai,n| S2,n) 


ss 6; oo 1 10 , 6, oa 1105 m= (1—712)02 


o : »_ (1—m2)62 
A:+6, — 6; + 0 A +62 


os nae =” 


6,+ 6)" [B12] X[1— (1—y) +6.) [B13] 





Psychological Review 
1961, V 68, No. 1, ¢ 


AN ANALYSIS OF GSR CONDITIONING 


M. A. STEWART, J. 


f Psychiatry and N: 
School of Med 


Department 


defined 
no- 


Classical conditioning was 
by a round table conference on 


menclature in animal behavior as 


The process of acquisition by an animal of 
the capacity to respond to a given stimulus 
with the reflex reaction proper to another 
stimulus (the reinforcement) two 
stimuli are applied concurrently for a num- 
ber of times (Thorpe, 1956) 


This definition fits the 
periment done by Pavlov, in which a 


1 


when the 


type of ex- 


specific form of response, namely sali- 
is conditioned to a _ neutral 

such as the sound of a 
metronome. By “specific form of re- 
sponse” in this 
response which is naturally evoked by 
This 


own 


vation, 
stimulus 


context we mean a 


only a limited class of stimuli. 


idea is reflected by Pavlov’s 


definitions of the terms: 
In the unconditioned reflex, those prop 
erties of the substance to which the saliva is 
physiologically adapted act as the stimulus, 
for example, the hardness, the dryness, the 
properties etc.; in the 
hand, those 


direct relation to 


definite chemical 
conditioned reflex, on the other 
properties which be: 
the physiological role 
for example, color, 
1928). 


the saliva act as 
stimuli, form and the 
like (Pavlov, 

While theoretically 
neutral in 
any response, there is no practical 


no stimulus can 
be completely relation to 
problem in finding stimuli which are 
neutral in relation to a response such 
as salivation. But much of the work 
done in classical conditioning, particu- 
larly with human subjects (Ss), has 
involved responses which are far from 
specific in this sense, for 
GSR, eveblink, and startle responses 


instance, 


In working with these responses one 


finds that almost no stimuli are neu- 


A. STERN, G. 


WINOKUR, anno S. FREDMAN 


, Washington University 
St. Louis 


tral, and hence that the reflex action 
proper to the conditioned stimulus is 
the same as that proper to the uncon- 
ditioned stimulus, though varying in 
extent. It follows that experimenters 
in this field should always be aware of 
the possibility that they are producing 
“pseudoconditioning,” or an augment- 
ing of an original response to the con- 
ditioned stimulus rather than true 
conditioning. 

The problem is reviewed by Hilgard 
nd Marquis (1940), who say: 


the augmentation of an original 
to the conditioned stimulus through a con 
procedure may be described as 
sensitization. This is merely a form of 
pseudo-conditioning, unless it can be demon 
strated that the sensitization is a function 
f the repetition of conditioned and un- 
conditioned stimuli in precise relationship. 


responst 


ditioning 


We doubt whether it is right to speak 
of conditioning rather than sensitiza- 
tion even if this qualification is met. 
The experiments of Bernstein (1934) 
and Hunter (1937) showed that in 
the case of eyeblink responses in man 
and startle responses in rats, con 
ditioned occurred at dif- 
ferent latencies than those of the origi- 
nal responses to the conditioned stimuli, 
and in the case of the rats could 
further be distinguished from the orig- 
inal responses by the fact that this 
tended to with the 
number of conditioning trials and de- 
crease during extinction trials, while 
the latency of the original responses 
remained constant. Our own experi- 
working with the GSR, has been 


responses 


latency increase 


ence, 
that true conditioned responses, that is, 
responses which are not elicited by the 





An ANALYsIS OF GSR CONDITIONING 61 


conditioned stimulus before its being 
paired with the unconditioned stim- 
ulus, can be differentiated from other 
responses to the conditioned stimulus 
on the basis of latency. We have 
used this criterion in previous experi- 
ments (Stewart, Winokur, Stern, Guze, 
Pfeiffer, & Hornug, 1959; Winokur, 
Guze, Stewart, Pfeiffer, Stern, & Hor- 
nug, 1959) and further explored its use 
in the experiment to be described be- 
low. 

There is no a priori reason for 
thinking that does not 
follow discoverable laws as do habitua- 
tion and conditioning. Only if this 
were so would it be reasonable to dif- 
ferentiate sensitization from condition- 
ing on the grounds that the one 
depended on the repetition of the stim- 
uli in relationship and_ the 
other did not. In our opinion, there- 
fore, the authors who have paid most 
attention to this problem previously, 
White and Schlosberg (1952), were 
not justified in their assertion that con- 
ditioning of the GSR can only be ef- 
fected by using a .5-second delay period, 
and that the use of other delay periods 
results only in pseudoconditioning. 
We propose to review representative 
previous papers on conditioning the 
GSR and that this work has 


sensitization 


precise 


show 


probably so far dealt only with sensi- 
tization, and not with true conditioning. 


In the paper referred to above, 
White and Schlosberg describe an ex- 
periment in which they compared the 
effects of varying the delay period be- 
conditioned and unconditioned 
while conditioning the GSR 

Groups of Ss were con- 


tween 
stimuli 
to light. 
ditioned with delay periods of 0, .25, 
.5, 1, 2, and 4 seconds and a control 
group was given the same stimuli un- 
paired at an interval of 30 seconds, 
which was one-half the intertrial inter- 
val used in the test groups. The authors 
do not precisely define what responses 


were called conditioned responses, but 
say “the primary data consisted of 
measured deflections on each of the 
five extinction trials, and on a final 
shock.’”” No mention is made of the 
time relationships of these deflections 
to the stimuli, even though a reference 
is made to the fact that a response was 
evoked by the light before conditioning 
trials began. The number of condi- 
tioning trials in each group was five, 
and no initial habituation trials were 
carried out. In our experience very 
few true conditioned responses appear 
within the first five trials in this type 
of experiment when normal Ss are 
used, while the previously habituated 
unconditioned response to the con- 
ditioned stimulus reappears at the sec- 
ond conditioning trial. We assume, 
therefore, that the deflections to which 
the authors refer are most probably 
augmented unconditioned responses to 
the conditioned stimulus. The find- 
ings of this investigation were that 
only the group conditioned with the 
.5-second delay period showed signifi- 
cantly larger conditioned responses 
than the control group. It is interest- 
ing to speculate whether even this 
significant difference might not have 
disappeared if the experimenters had 
used a longer intertrial interval than 
30 seconds for the control group. It is 
well establish-4. for instance by the 
work of Berg and _ Beebe-Center 
(1941), that the adaptation of un- 
conditioned responses to stimuli is 
accelerated by decreasing the interval 
between stimuli, and slowed or stopped 
altogether by lengthening the interval. 

3itterman and Holtzman (1952), 
in their study of GSR conditioning as 
a measure of anxiety, say: 


The initial response to tone disappears to 
a great extent during the unreinforced 
adaptation trials; during the conditioning 
trials the tone response grows in negatively 
accelerated fashion, and then it tends to 





decline during the extinction 


series. 


once more 


The “tone response” is defined later 
in their paper as “the initial deflection 
produced by the tone.” In a study on 
GSR conditioning in psychiatric pa- 
tients, Howe (1958) states: 

It was evident in the vast majority of cases 
that adaptation was then complete, since, at 
the fourth pretesting presentation of the tone, 
the response was either zero or negligible. 


He does not actually define what he 
then regarded as a conditioned response 
to tone, but says that after each stimu- 
lation “minimal and maximal galvano- 
metric readings were taken,” without 
specifying any time relations of the 
responses. Franks (1956) in des- 


cribing the experimental technique for 
his study of conditioning in normal 
and neurotic Ss, says: 


Before the conditioning session was begun, 
each S was given three tone stimuli, fol- 
lowed by three air puff stimuli (not paired 
with the CS) and then three more tone 
stimuli. Only Ss who did not give PGR 
or eyeblink responses to the last three tone 
stimuli were included in the conditioning 
study. The purposes of these trials were 
to eliminate those Ss who showed any evi- 
dence of pseudo-conditioning or original 
sensitivity to the tone. 


He then later defines an eyelid or 
GSR conditioned response as a deflec- 
tion in the record occurring during a 
latency of between 0.5 and 8.0 seconds 
after the onset of the CS. 

All these authors, then, have left 
the significance of their findings in 
doubt for want of definition of the 
response treated as a conditioned re- 
sponse, and have left ample grounds 
for the suspicion that their findings do 
not relate to conditioning but to sensi- 
tization of the GSR. 

In addition, their methods raise cer- 
tain general problems of GSR con- 
ditioning which invite comment. The 
first is the soundness of excluding Ss 


62 M. A. Stewart, J. A. STERN, G. WINOKUR, AND S. FREDMAN 


who give more than a few uncondi- 
tioned responses to the conditioned 
stimulus, as was done in Franks’ study. 
This is a purely arbitrary division of 
Ss which depends to a large extent on 
the sensitivity of the recording ap- 
paratus. The important result of this 
exclusion, however, is the skewing of 
the test group in favor of nonreactive 
Ss, which must invalidate any com- 
parison of clinical subgroups such as 
Franks was interested in. Secondly, 
the work of Landis and Hunt (1935) 
and of Cook and Harris (1937) has 
shown that the threshold for evocation 
of the GSR and the general reactivity 
of an § measured by the GSR, is 
markedly affected by uncontrollable 
factors such as the S’s feelings and 
general factors such as instructions 
given to the S$ at the outset of the 
experiment. The implication of this 
is that the general conditions of the 
experiment and in particular the in- 
troduction to the experiment must be 
held constant. Finally, it appears from 
the work of Prosser and Hunter 
(1936) that a conditioned stimulus 
which initially evokes no uncondi- 
tioned response may do so after being 
paired with a painful unconditioned 
stimulus a number of times through an 
alteration of the threshold for evoca- 
tion of the response. These authors 
found that when a sound, which in- 
itially evoked no startle response, had 
been presented to rats together with 
a shock a number of times, the sound 
alone would produce a startle response 
with exactly the same latency as that 
which was evoked by a sound loud 
enough to evoke startle in the first 
place. As we have already mentioned, 
these authors distinguished this type 
of response from a conditioned startle 
response on the basis of its latency and 
the variation of this latency as opposed 
to the constancy of the unconditioned 
response latency. They also noted that 








An ANALYsIs OF GSR CONDITIONING 63 


while the unconditioned response was 
evoked either immediately or <fter 
few trials, many trials were necessary 
before the conditioned response ap- 
peared. These observations seem es- 
pecially relevant to determining the 
true status of GSR to tone in any 
conditioning experiment 

In order to determine whether gal- 
vanic skin responses could be condi- 
tioned, using the latency of response 
as the chief criterion distinguishing 
conditioned responses from others, we 
carried out the following experiment. 


METHOD 


The Ss were 19 student nurses from the 
Barnes Hospital School of Nursing. A 
uniform introduction to the experiment was 
given to the Ss, put in general terms, and 
not telling them that they would receive 
shock. We made a point of asking Ss not 
to let their fellow students know what went 
on in the experiment. The apparatus con- 
sisted of a Fels Dermohmeter connected to 
an Esterline-Angus chronograph, a series 
of three electronic timers, an Eico audio- 
generator, and a shocking device. The S 
sat in a comfortable chair in a room separate 
from the experimenter. Recording elec- 
trodes in the form of zinc cups filled with 
an electrolyte jelly were placed on the S’s 
hand, one on the palm and one on the 
dorsum. Shock electrodes were placed on 
the S’s ankle, and the headphones through 
which the conditioned stimulus (a tone of 
500 cps frequency and about 65 db. in- 
tensity) was presented, over the S’s ears 
After a 5-minute rest period, the S was 
presented with the conditioned stimulus 
alone for 20 trials (adaptation period), the 
stimulus lasting 6.5 seconds and the intertrial 
interval varying at random between 30 sec- 
onds and 90 seconds. Next, 20 paired trials 
(conditioning period) were given at similar 
intervals, in which the unconditioned stim- 
ulus, a painful shock, was presented for 
.5 second 1.0 second after the end of the tone. 


RESULTS AND DISCUSSION 


The responses, or deflections, which 


appeared on the records could be 
separated into four categories. These 
were (a) spontaneous ‘deflections oc- 


curring between trials, (b) and (c) 
unconditioned responses to the tone 
and to the shock, and (d) conditioned 
responses to the tone, or “conditioned 
anticipatory responses.” In Figure 1 
are shown the mean values and the 
ranges of latencies for responses of 
Categories b, c, and d, from Ss 
taken alternately from a randomly ar- 
ranged list of the group. Only the 
responses from the conditioning period 
are shown, but the characteristics of 
the unconditioned response to tone 
were the same in both periods. In 
particular, the relative constancy of 
the latency of response in any one S§ 
which can be seen in this diagram 
is true of both periods. 

In drawing up our criteria for de- 
fining a “conditioned anticipatory re- 
sponse,” we made use of the constancy 
of the unconditioned response latency. 
These criteria were: (i) the response 
had a latency which fell outside the 
range of latency for the unconditioned 
response to tone as recorded in the 
initial adaptation period, (ii) the re- 
sponse occurred at a time before a 
response to shock was physiologically 
possible, i.e., before a point 1.0 second 
after the onset of the shock, and (iii) 
the height of the deflection was at least 
equal to one of the small divisions on 
the vertical scale of the record paper. 
In practice, Criterion i usually meant 
that the response came at least 4.0 sec. 
after the onset of the tone. 

Using these criteria, two of the in- 
vestigators (JS and MS) went over 
the records independently, counting the 
responses at each trial. The reliability 
coefficient of their judgment of the 
total number of conditioned responses 
given by each § was .975. Perfect 
agreement was present in their judg- 
ments of the response number for 13 
out of the 19 Ss (68%), and agree- 
ment within a margin of one response 
in 17 out of the 19 (90%). 





M. A. STEwarrt, J. 


ORIENTING RESPONSE v 
ANTICIPATORY RESPONSE a 


\. STERN, ( 


r. WINOKUR, AND S. FREDMAN 


RESPONSE TO SHOCK e 
RANGE OF RESPONSE [£2] 





9 
v 


ee ee ee TTT en nn 


7 
. r 


yVUHvr ve virvei ier venTveneHTP UNTO UUTHUNUAUTTOTA 
17 
5 ’ 
18 
7 v 
9 12 


8 
HN v ‘ 


19 
13 fs 


SUBJECT NUMBER 


15 


19 
17 ’ 


eee IT UTI ELL 


u 
19 , 


MN 


fone on 


7 oe ee 


, a r 


HET EUTTOT URL 
ee TOT TOC ec 
a Oe To 


v 
Tn noo 


4 
{YVQOQOUVQOROU TATOO AYEO0 VOUYER HN Giiit 
5 


LJ 4 
ee WTNH 
i 


4 
ULLAL Re HAHN 
iu 1 TA 


3 
4 














9 
“ 





Tm — 





9 
4 





Im 





4 
a 





—— II 


| 
| 
| 
| 
; | 
| 
| 
] 








CE DDOAT LAAT EPEHOPET OPED ATA A A HTH 





l2 

















In 





2 
+. 





Tu Shock on 


me. Sa en ra re 





TIME IN SECONDS 


Fic. 1. 


The means and ranges of the latencies of the unconditioned responses to tone 


and to shock, and of the conditioned anticipatory responses, during the conditioning period. 


(Data are presented for alternate Ss only. 
the number of which the 
unconditioned response to tone.) 


responses on 


Figure 2 shows representative sam- 
ples of a record from one of the Ss. 
The sample showing an example of 
the three responses occurring at one 
trial illustrates the fact that on oc- 
casion the conditioned was 
part of a complex of deflections rather 
than being discrete. This has not 
actually made for any great difficulty 
in scoring the records, though it made 
it impossible to measure the amplitude 
of the deflections. It however, 
underlined the possibility that the de- 
flection which we have called a con- 
ditioned response is part of the normal 
unconditioned response to tone or that 


resp¢ mse 


has, 


mean and 


The numbers within the speckled areas refer to 


range is based. Orienting response 


it is a random spontaneous deflection. 
Responses corresponding to the con- 
ditioned responses of the second period 
did in fact occur in the initial adapta- 
tion period. Altogether there were 
13 such responses, four Ss making one 
each, two Ss making two, and one S 
making five. These responses ap- 
peared to be randomly distributed over 
the adaptation trials, and there did not 
appear to be any correlation between 
the number of “conditioned responses” 
made in this period and the number 
made in the conditioning period. On 
the other hand, these responses were 
seen in the initial period in those Ss 








AN ANALYSIS OF GSR CONDITIONING 65 





who had the highest rate of sponta- 
neous deflections. The number of re- 
sponses being small it was not feasible 
to treat these observed relationships 
statistically. 

From these observations we tend to 
think that the occurrence of these 
“conditioned responses” in the initial 
period was the result of spontaneous 
GSR activity. Further evidence to 
support the idea that the responses oc- 
curring in the two periods were not 
of the same kind comes from the rates 
of spontaneous deflections in the two 
periods. We were able to count the 
spontaneous deflections in both periods 
in 16 out of the 19 Ss, and found a 
total of 260 in the first period, and 395 
in the second, giving a ratio 1: 1.5. 
For the whole group the total of “con- 
ditioned responses” in the first period 
was 13, and in the second 77, giving a 
ratio of 1:5.9. From this comparison 
alone it would seem that the condi- 
tioned responses occurring in the con- 
ditioning period are not the product 
of spontaneous activity. 

Figure 3 shows the number of re- 
sponses given by the group at each 
trial of the two periods, leaving out 
unconditioned responses to _ shock. 
Gradual adaptation of the uncondi- 
tioned response to tone is clearly pres- 
ent in the curve for the adaptation 
period, all of the 19 Ss_ respond- 
ing at the first trial, four at the last 
trial, and only one at the first trial 
of the conditioning period. At the 
second and following conditioning 
trials a marked recovery of this re- 
sponse takes place, and relatively little 
extinction is present at the end of the 
conditioning period. 

The curve of the conditioned antic- 

































































2nd Conditioned 


——fr009 — + py gg 



































Fic. 2. Three samples from the record 
of a single subject. (The record reads from 
right to left, and each small division along 
the horizontal axis represents .5 second.) 








M. A. Stewart, J. A. STERN, ( 





Fic. 3. The number of unconditioned 
responses to tone, and conditioned antic- 
ipatory responses, given by the whole group 
at each individual trial. 


ipatory response approximates an in- 
verted U shape, with a peak of eight Ss 
responding at the eighth trial, and a 
decline to the low level of two respond- 
ing at the twentieth trial. This shape is 
in obvious contrast to the usual learn- 
ing curve, but it is also unlike the 
adaptation curves for unconditioned 
responses to tone. Our working as- 
sumption at present is that the “con- 
ditioned response” curve does in fact 
represent the true 
conditioned response, and that the ex- 
tinction of the response is a function 
of massed reinforcement. A _ crucial 
experiment, then, for our hypothesis 
is a comparable experiment in which 
the intertrial con- 
siderably longer, or in which aperiodic 
schedules of reinforcement are used. 


acquisition of a 


intervals will be 


In summary, we have put forward 
reasons for thinking that so far work 
on GSR conditioning has dealt with 
the adaptation and recovery of un- 
conditioned responses rather than con 
ditioning of We _ have 
described criteria for defining true 
conditioned GSR in terms of the re- 
sponse latency, following methods used 
by previous investigators working with 


responses. 


r+ WINOKUR, AND S, FREDMAN 


The 


use of these criteria is illustrated in an 


eyeblink and startle conditioning. 


experiment in which we tried to con- 
dition GSR in a group of 19 normal 


Ss. Responses were obtained which 


from 
which 


could be distinguished uncon- 


ditioned responses and had 
some of the characteristics of true con- 


ditioned responses. 


REFERENCES 


Bere, R. L., & Beepe-Center, J. G. Cardiac 
startle in man. J. exp. Psychol., 1941, 
28, 262-279. 

BernsTEIn, A. L. Temporal factors in the 
formation of the conditioned eyelid re- 
actions in human subjects. J. gen. Psychol., 
1934, 10, 173-196. 

BitterMAN, M. E., & Hortzman, W. H. 
Conditioning and extinction of the gal- 
vanic skin response as a function of 
anxiety. J. abnorm. soc. Psychol., 1952, 
47, 615-623. 

Cook, S., & Harris, R. Verbal condition- 
ing of the GSR. J. exp. Psychol., 1937, 
21, 202-210. 

Franks, C. M. Conditioning and person- 
ality: A study of normal and neurotic 
subjects. J. abnorm. soc. Psychol., 1956, 
52, 143-150. 

Hircarp, E. R., & Marguts, D. G. 
ditioning and _ learning. New 
Appleton Century, 1940. 

Howe, E. S. GSR conditioning in anxiety 
normals, and chronic functional 
schizophrenic subjects. J. abyorm. s 
Psychol., 1958, 56, 183-189. 
Hunter, W. S. Muscle 
conditioning in the rat. J. exp 

1937, 21, 611-624. 

Lanois, G., & Hunt, W. A. The con- 

correlates of the galvanic skin 

J. exp. Psychol., 1935, 18, 


Con- 
York: 


states, 


potentials and 
Psychol 


scious 
response. 
506-529. 

PAVLOV, i. FF. 
flexes. New 
Press, 1928 

Prosser, C. L., & Hunter, W. S. 
tinction of startle responses and 
reflexes in the white rat. Amer. J 
Physiol., 1936, 117, 609-618 

Stewart, M. A., Wrnoxur, G., STERN, J., 
Guze, S., Prerrrer, E., & Hornue, F. 


Lectures on conditioned re 
York: International Univer 


The ex- 
spinal 








An ANALYsIS OF GSR CONDITIONING 67 


Adaptation and conditioning of the GSR of the period of delay. J. exp. Psychol., 

in psychiatric patients. J. ment. Sci., 1959, 1952, 43, 357-362. 

105. 1102-1111 Winokur, G., Guze, S., Stewart, M., 
Tuorre, W. H. Learning and instinct in Preirrer, E., Stern, J., & Hornus, F. 
Association of conditionability with degree 
of reactivity in psychiatric patients. Sci- 
ence, 1959, 129, 1423-1424. 


animals. Cambridge: Harvard Univer. 
Press, 1956. P. 72. 

Waite, C. T., & ScHtosserc, H. Degree 
of conditioning of the GSR as a function (Received November 17, 1959) 





Psychological Revie 


1961, Vol. 68, No. 1, 68-8 


EFFECTS OF SCORE 








TRANSFORMATIONS IN Q AND R 


FACTOR ANALYSIS TECHNIQUES’ 


DONALD M 


} ster 


Controversy over the relationship of 
Q factor analysis (correlating people) 
to RX factor analysis (correlating tests) 
has existed since Stephenson's intro- 
duction of Q technique in 1936. 
Stephenson (1936, 1952, 1953) has 
maintained that Q technique gives dif- 
ferent results than R technique. Burt 
(1937), Eysenck (1953), and others, 
on the other hand, have argued that 
the two techniques are essentially 
transpositions of each other, i.e., factors 
of visual acuity and manual dexterity 
obtained from an FR analysis will ap- 
pear as factors o 


oO 


f visually acute and 
manually dextrous individuals in a Q 
analysis of the same data. Cattell 
(1952a, 1952b), while agreeing with 
this latter position, has pointed out 
certain differences which make RF pref- 
erable to Q technique. 

This controversy reached its climax 
when Stephenson (1952) and Cattell 
(1952b) simultaneously published ar- 
ticles stating their respective positions. 
In retrospect, it that Cattell’s 
position has been the more popular 
of the factor analyses 
reported in the past seven years have 
employed the R technique. 


The 


seems 


since over 95% 


present paper presents new 
arguments and empirical evidence of 
‘This paper was supported by the De- 


mentia Praecox Research Project, Worces- 
ter State Hospital, and a research grant 
(M-896), from the National Institute of 
Mental Health, United States Public Health 
Service. The writer prepared this paper 
while a Public Health Postdoctoral Fellow 
in Clinical Psychology. Thanks are due to 
Melvin Feffer, Morton Wiener, and Jeffrey 
Caine for their invaluable comments 
ing the preparation of this paper. 


dur 


B 


State fie 


ROVERMAN 
pital, Massachusetts 

important differences not previously 
recognized between the two techniques 
as they are commonly employed. The 
paper will try to show that the contro- 
versy over the Q and F techniques 
may be clarified by an examination of 
the effects of standardizing 
across the direction of correlation, 1.e., 
standardizing columns when correlat- 
ing rows, or vice versa. It will be 
argued that such procedures, which 
are customary in Q analysis, change 
the significance of scores, which, in 
turn, alters the inferences that 
be drawn from the data. 


scores 


may 


STANDARDIZING ACROSS THI 
DIRECTION OF CORRELATION 


Standardizing Columns When Cor- 

relating Rows 

Cattell (1952a) has extensively dis- 
cussed the effects of standardization of 
data on factor solutions. His general 
position is that standardization results 
in some loss of information but pre- 
serves the remaining data relation- 
Thus, Cattell pointed out that 
if body measurements, e.g., weights, 
arm lengths, and finger lengths of in- 
dividuals are compared, the “profiles” 
for all subjects (Ss) will be similar 
since weight, measured in pounds, will 
be largest for everyone, while finger 


ships. 


length, measured in inches, will be 
smallest. Accordingly, 
All people will correlate highly together 


in what we may call a “common species” 


factor measuring various individual 
resemblance to common man. This factor 
is irrelevant for most differentiation within 
the species homo sapiens, and only after 
its extraction shall we come to the factors 








Score TRANSFORMATIONS IN O AND R TECHNIQUES 69 


of individual differences in which we are 
interested (Cattell, 1952a, p. 96). 

Transformation of the above body 
measurements into standard scores, 
according to Cattell, eliminates the 
“species” factor, since weight and 
finger length now have distributions 
with similar means and ranges. Ac- 
cordingly, the individual differences 
factor, which appeared second in the 
Q analysis of the raw data, should 
now appear first. The only effect of 
standardizing columns of measure- 
ments when correlating rows of people, 
according to Cattell, then, is the loss 
of the “species” factor. 

Cattell’s analysis of the effects of 
standardization on factor solutions is 
only partially correct. Besides losing 
the information pointed out by Cattell, 
standardizing columns of scores when 
correlating rows of people may bring 
out relationships between the rows 
which are not observable in the raw 


data. The reason for this is as 
follows ; 
When rows of raw data, such as 


body measurements, for two Ss are 
correlated in a Q analysis, each S’s 
“profile” or distribution of scores is 
treated as a separate unit unconnected 
with the data of other Ss. The cor- 
relation between the two Ss, then, is 
based exclusively on the properties of 
each of these two distributions. How- 
ever, when the columns of scores are 
standardized, each score in the person’s 
profile is altered by a consideration, 
i.e., its relationship to the measure- 
ments of the other Ss in that distribu- 
tion, which did not enter into the 
above correlation. Thus, a score that 
was previously small in the raw data, 
such as finger size, may remain small 
in the standardized distribution if the 
finger is small among a population of 
fingers, or it may become a large score 
if it is a large finger in that popula- 
tion. These vagaries, which reflect 


the individual differences between Ss, 
alter the relationships between rows. 

Cattell’s suggestion that this infor- 
mation about individual differences 
will be contained in the residuals of 
the raw score correlations after ex- 
traction of the “species” factor is in- 
correct, since this information, as in- 
dicated above, does not enter into the 
original correlations and the residuals 
contain only that part of the original 
correlations not accounted for by the 
first factor. Residual correlations can- 
not contain information not included 
in the original correlations. 

The above argument is illustrated 
by the example suggested by Cattell, 
i.e., a table of body measurements. 
Table 1 presents imaginary raw data 
for 12 Ss on 6 body measurements : 
height, arm length, foot size, neck 
size, and body weight. A Q analysis 
of these data,? as shown in Table 2, 
results in high positive intercorrela- 
tions of all Ss, which, in turn, pro- 
duce the massive “species” factor, pre- 
dicted by Cattell. This factor is so 
massive that it includes over 99% of 
the total possible communality. The 
residual variance, which, according to 
Cattell, contains the individual dif- 
ferences information, would normally 
be considered too small to warrant 
further consideration. However, Table 
2 also shows a minute second factor 
which was extracted from the remain- 
ing variance in order that comparisons 
might be made with the first Q factor 

2Tryon’s Cumulative Cluster Analysis 
(1958) was employed throughout this paper. 
This technique was used as a matter of 
convenience since it proceeds directly to an 
approximate simple structure factor solu- 
tion similar to that obtained after rotation 
of axes in the more common centroid 
methods of analysis (Cattell, 1952a). How- 
ever, while the factors discussed in this 
paper are all approximate simple structure 
solutions, the arguments apply to all factor 
solutions whether or not they are rotated 
to simple structure. 











DoNALD M. 


rABLE 


BROVERMAN 


A 


Bopy MEASUREMENTS, 
Raw Data 


Height Arm Length Foot Size Neck Size Waist Size Weight 

feet) nches inches) inches) inches) younds) 

51 6.3 36.0 12.0 17.0 36.5 195.0 

$2 5.6 33.0 9.0 13.5 33.0 150.0 

S3 5.9 35.0 11.0 14.5 32.0 155.0 

S4 5.7 34.5 11.5 17.5 39.0 225.0 

S5 5.1 31.5 an 5.0 34.5 180.0 

S6 5.3 32.5 8.5 18.0 37.0 215.0 

S7 5.4 32.0 10.0 16.5 38.0 210.0 

S8 6.2 35.5 13.0 15.8 33.5 160.0 

So 5.8 34.0 9.5 14.0 34.0 165.0 

510 5.2 31.0 8.0 16.3 35.5 200.0 

Sil 5.5 33.5 10.5 16.0 36.0 205.0 

$12 6.0 37.0 12.5 15.5 35.0 175.0 
obtained after the columns of scores according to Cattell, is the equivalent 
are standardized. vf the small individual differences 
Table 3 presents the same body _ factor obtained from the raw data. 


measurements with each column stand- 


However, unlike that diminutive fac- 


ardized to a mean of 50 and a sigma _ tor, the individual differences factor 
of 10. Table 4 shows the first Q obtained from the standardized scores 
factor obtained from this data, which, is almost as massive (it contains 80% 
TABLE 2 
INTERCORRELATIONS OF ROWs AND O Factor LOADINGS OF Bopy MEASUREMENTS 
Raw DATA 
Fa 
Inte relat : 
Loa g 
$1 S $3 S4 S5 S6 S7 S8 $9 $10 Sil $12 
$1 9990 | 9992 9997 9998 9996'9996| 9995 9996 9997 9999 9997°9995 0319 
$2 0002 9998 9977 9991 9977 9979 9996 9998 9981 9983 9996 9997 —0020 
S3 0003 0001 9980 9990 9978 9979 9999 |9998 9981 9985 9999 9999 —0059 
S4 0018 0004 0004 9995 9999 9999 9984 | 9986 9999 9999 9987 9984 0558 
$5 0011 0003 0002 0020 9996 9996 9991 9996 9998 9997'9993 9992 0379 
S6 0019 —0002 0004 0033 0023 9999 9981 |9985 9999 9998 9984 9982 0595 
$7 0018 0001 0004 0032 0021 0034 9984 9987 9999 9999 9986 9983 0569 
S8 0000 0001 0000 0000 0000 0000 0000 9998 9989 9989 9999, 9999 0001 
S9 0002; 0003 0000 0004 0006 0005| 0006' —0001 9988 9990 9998 9998 0096 
S10 0018 0001 0003 0031 0022 0034 0032 0000 0006 9999 9987 9984 0569 
$11 0015 0003 0003 |0027 0017 0028 0027 0000 | 0003 0026 9991 9989 0468 
$12 0002 0001 0000 0003 0002 0003 0003' 0000 |0000 0003 0003 9999 0053 
Tota] Communality of hy (h;? 1.980 
lotal Possible Communality = 12.000 
* Decimals omitted es above the diagonal are the original correlations. Values below the diagonal a 
lual correlat 2 xtraction of the first factor. 





ScorE TRANSFORMATIONS 


rABLE 3 


Bopy MEASUREMENTS 
COLUMNS STANDARDIZED TO A MEAN OF 
AND A SIGMA OF 10 


50 


Height a — | oe | waist | Weight 
S1 67 | 62 | Ss. | 58 | SS | 5! 
S2 49 | 47 | 45 | 33 | 38 | 33 
S53 55 SS | SS 42 33 38 
S4 51 53 | 55 62 67 67 
S5 33 38 33 15 47 49 
S56 $2 45 42 67 58 62 
S7 45 42 49 55 62 58 
S8 62 58 67 49 42 42 
59 53 | 51 $7 38 15 15 
S10 38 33 38 53 51 53 
S11 47 49 51 51 53 55 
$12 58 67 62 47 49 47 
of the total possible communality) as 


the first “species” factor obtained from 
the raw data. Examination of the 
factor obtained from the standardized 
scores reveals that it is defined equally 
by positively and negatively loaded Ss. 
If one ignores the absolute size of the 
second factor extracted from the raw 
data and simply examines the relative 


I 


xs O anpd FR TECHNIQUES 71 
weights of its loadings, it is seen to 
be defined predominantly by positively 
loaded Ss. The of the two 
negative loadings is only 6% of the 
average of the three largest positive 
loadings. Thus, standardization of 
scores not only brings out individual 
differences with much greater strength, 
but also presents a different picture 
of the relationships between Ss than 
does a Q analysis from raw data. 
Clearly the Q factor obtained after 
standardizing columns of scores is a 
different factor than that obtained or 
obtainable from 
raw data. 


average 


a Q analysis of the 


Standardizing Rows When Correlating 
Columns 


The above argument implies that 
standardizing rows, i.e., standardizing 
each individual’s distribution of scores, 
will alter the column correlations em- 
ployed in an FR analysis. This is il- 
lustrated below: 

An R analysis of the raw data in 
Table 1 produces the table of inter- 


rABLE 4 


INTERCORRELATIONS OF ROWS AND Q FACTOR LOADINGS OF STANDARDIZED 


COLUMNS OF 


30DY MEASUREMENTS 


Factor 

Inte 1s* Load 

ng 

$1 §2 $3 $4 55 6 S7 Ss So $10 Sil S1 
$1 82 80 | —90 80 68 81 72 63 —74 99 63 848 
S2 08 85 —90 | —90 -99 | —83 83 &8 —95 82 &9 872 
S3 04 | —02 —98 | —92 -83 | —98 93 66 | —92 | —83 87 996 
S4 -07 | —04 00 94 87 96 | —92 -—70 92 92 86 98? 
S5 03 —11 —O1 05 93 83 | —98 | —66 88 82 -§2 909 
S6 03 | —26 90 05 17 78 | —86 | —87 94 68 89 838 
S7 02 03 00 00 06 04 —83 | —69 92 85 | —88 982 
S8 05 04 03 | —03 | —16 10 06 56| —85 | —75 82 -905 
S9 04 28 —03 02 03 29 01 —07 —81 —59 74 693 
$10 05 —14 01 00 03 16 00 | —01 -16 76 | —98 932 
S11 —25 | —05 04 06 02; —06 | —01 O4 02 | —06 —66 878 
S12 —12 12 | —0l 01 | —02) —15 |} —01 02 13 | —16 11 882 
® Decimals omitted. Values above the diagonal are the original correlations. Values below the diagonal are 
the residual correlations after extraction of the first factor. 











DoNALp M. 


BROVERMAN 


TABLE 5 


COLUMN INTERCORRELATIONS AND R Factor LOADINGS OF RAW Data 


Intercorrelations* 


Heigl! Arm Length Foot Size 
Height 92 89 
Arm Length 01 90 
Foot Size 00 00 
Neck Size —(07 —O05 12 
Waist -08 —O05 14 
Weight 09 —(04 13 
® Dex als omitted Values above the diagonal are the origi 
the residual correlations after extraction of the first factor 
correlations and the two AF factors 
shown in Table 5. The first of these 
factors is defined by height, arm 


length, and foot size; the other by 
weight, waist size, and neck size. An 
R analysis of the standardized columns 
presented in Table 3 would produce 
essentially the same factors, since 
standardization of columns alters the 
column intercorrelations only to the 
extent that the column distributions 
are skewed 
Standardizing 
ically alters the R factor solution of 


rows, however, rad 





this data, since this transformation 
rABLE 6 
Bopy MEASUREMENTS 
STANDARDIZED First BY COLUMNS, 
THEN BY Rows 
+ , ¥ . 

4 E., Z. Z. = 1 

~ - ~ “4 z x 

z = = r S v 

ome < oe = = 
S1 64 5 50 50 43 36 50.0 
Ss 64 57 52 39.5 | 48 39.5 | 50.0 
S3 60.5 60.5 5 48 36 43 50.0 
S4 36 43 48 52 60.5 | 60.5 | 50.0 
Ss 390.5 48 30.5 52 57 64 50.0 
S6 39.5 48 399.5 64 52 57 50.0 
s 43 + 48 52 64 57 50.0 
Ss $7 52 64 48 39.5 | 39.5) 50.0 
So 64 5 §2 36 45.5 | 45.5 50.0 
S10 45.5 36 45.5 60.5 | 52 60.5 | 50.0 
Sil 36 43 50 50 57 64 50.0 
S12 52 64 57 39.5 48 39.5 | 50.0 

Means of 
Columns | 50.4 50.1 49.8 | 49.3 | 50.2 | 50.5 





Factor Loadings 


Neck Size Waist Weight hy he 
—05 — 20 — 34 953 — 085 
—03 —17 — 30 960 —005 

14 02 —12 939 139 

83 88 021 910 
83 95 — 123 931 
89 92 — 267 974 


nal correlations. Values below the diagonal are 


cuts across the line of correlation. The 
standardized columns of data in Table 
3 are shown restandardized along each 
row in Table 6.25. An FR analysis of 
this matrix produces a single factor, 
shown in Table 7, which is defined 
positively by height, arm length, and 
foot size; and negatively by weight, 
waist, and neck size. Thus, whereas 
the RX analysis of raw scores produced 
two independent factors, standardiza- 
tion across rows enables these data to 
be characterized by a single R factor. 
Once more, then, we see that stand- 
ardization across the line of correla 
tion changes the factor solution. 


VERSUS SINGLE CENTERED 
MATRICES 


DouBLI 


It is generally agreed, since Burt’s 
mathematical demonstration (1937), 
that a doubly centered matrix, i.e., a 
matrix in which the columns and rows 
all have the same mean, will produce 
transposable factors. As indicated by 


’ Standardization of the rows of raw data 
leads to a matrix which cannot be analyzed. 
The raw score profiles of Ss are so similar 
that all rows would have virtually identical 
standardized distributions. This, in turn, 
creates columns of scores with little or no 
variation to correlate; e.g., since weight is 
the largest raw score for all Ss, the entire 
weight column would have the same stand- 


ard score. 











ScorE TRANSFORMATIONS IN OQ AND R TECHNIQUES 73 


TABLE 7 
INTERCORRELATIONS AND R Factor LOADINGS OF 
DouBLY STANDARDIZED DATA 


COLUMN 


Factor 


Intercorrelations* 
Loadings 


Height Arm Length Foot Size | Neck Size Waist Weight Ay 
Height 72 56 — 64 —81 —90 —945 
Arm Length —07 46 —67 —75 —81 — 832 
Foot Size —07 —09 —62 —56 —70 — 664 
Neck Size —09 —18 —23 35 61 584 
Waist 01 —03 01 —16 79 865 
Weight —02 — 04 —08 07 —Ol 930 


* Decimals omitted Values above the diagonal are the original correlations. Values below the diagonal are 


the residual correlations after extraction of the first factor. 





the marginal means in Table 6, a 
matrix of data which has been stand- 
ardized first by columns and then by 
rows approximates a double centered 
matrix. A Q analysis of these data, 
then, should produce a factor which 
is the transposition of the single R 
factor already obtained from this ma- 
trix. However, it is unnecessary to 
perform this analysis since the Q 
analysis of the standardized columns 
of Table 3 constitute, in effect, a Q 
analysis of a doubly centered matrix. 
This is because the correlational proc- 
ess treats the distributions to be cor- 
related as though they had the same 
means and sigmas. Since the columns 
in Table 3 were already standardized, 
correlation of the rows completed the 
process required to establish a doubly 
centered matrix. That this is so is in- 
dicated by Figure 1, where height, 
arm length, and foot size are seen to 
occupy higher positions in the dis- 
tributions of measurements of Ss 
loading positively on the Q factor ob- 
tained from Table 3 than weight, waist, 
and neck size measurements; with the 
reverse true of the negatively loaded Ss. 
This factor (height, arm length, and 
foot size, versus weight, waist, and 
neck size), obviously, is the same fac- 
tor as the one obtained from the R 


analysis of the doubly standardized 
matrix (Table 7), but expressed in 
terms of people. Thus, while a Q 
analysis of standardized columns is 
equivalent to an R analysis of a doubly 
centered matrix, both differ from an 
R analysis of either raw data or stand- 
ardized columns. 

Thomson (1939) has argued that Q 
correlations of raw scores involving 
different units are statistically mean- 
ingless since such scores do not form 
a metric continuum. Since that time, 
it has been standard practice in 
Q analyses (Babington-Smith, 1950; 
Eysenck, 1953) to standardize col- 
umns, thus producing an equivalence 


Bet ght Are Foot Neer Waist eign 





62 

& 
e bad 
F ¥ 
aa 
3 * o— oes 
3 $ 
a 
fei 
? - 

. 

Fic. 1. Means of positively and nega- 


tively loaded subjects calculated from the 
standardized profile scores presented in 
Table 6 








74 DonALp M 


of units, before correlating rows. As 
illustrated above, these procedures, in 
effect, create a doubly centered matrix. 
Hence, the usual Q analysis implicitly 
involves a doubly centered matrix, 
whereas the typical R analysis does 


not In view of the differences in 
results obtained from single versus 


double centered matrices demonstrated 
above, it seems meaningful to conclude, 
then, that R and Q analysis techniques, 
as commonly performed, are different 
and give dissimilar results. 


SIGNIFICANCE OF THE DIFFERENCE 


IN RESULTS 
The above discussion suggests that 
the differences in obtained 
from the different factor analysis tech- 
niques are a function of the different 


results 


scores involved in single versus double 
centered matrices rather than the di- 
rection in which correlation proceeds. 
Evaluation of the differences in results, 
then, requires a careful consideration 


of these scores. 


Vormative versus I psative Scores 


The usual R analysis of a single 
centered matrix involves ‘‘normative” 
scores (Cattell, 1944), or scores which 
reflect the standing of each S in each 
distribution with respect to other Ss. 
These express interindividual 
differences in terms of group norms. 
Hence, factors obtained from norma- 
tive scores describe patterns of inter- 
individual relationships occurring within 
For instance, a G factor 
some Ss are generally 


scores 


the group. 

implies that 
bright and do well on all tasks com- 
pared to other duller, poorer perform- 
Similarly, less general, specific 
refer to interindividual dif- 
ferences in abilities vary in- 
dependently of a G factor. Fk factors 
obtained from normative scores, then, 
reflect patterns of consistency within 
a group or population of Ss. Hence, 


ing Ss. 
factors 
which 





. BROVERMAN 


they might be appropriately termed 
“normative group” factors. 

The usual Q analysis or R analysis 
of a doubly centered matrix, on the 
other hand, first standardizes columns 
of scores about a group norm, and 
then restandardizes each row of scores 
about a norm derived from the in- 
dividual. Thus, these doubly stand- 
ardized scores, termed “ipsative” by 
Cattell (1944), indicate whether the 
performance of an individual on a 
particular task is above or below his 
mean level of performance for the 
entire battery.‘ Ipsative scores, then, 
are scaled about intra-individual, rather 
than group, norms, and reflect intra- 
rather than interindividual variations 
in ability levels. 

Factors derived from doubly stand- 
ardized ipsative scores (in both FR and 
Q techniques) also involve compari- 
sons of Ss. However, in this case, 
the comparisons are of the patterns of 
within-individual variation found in 
different Ss. These factors, then, 
characterize the consistencies in intra- 
person organization which occur in the 
Ss composing the sample. Hence, 
such factors might be appropriately 
termed “normative personological’” 
factors. 

It appears, then, that the two tech- 
niques produce factors which refer to 
different strata of behavior. Norma- 
tive group factors describe consist- 
encies in the composition of a group 
Such characterizations may be useful 

4It is also possible to develop ipsative 
scores from the responses of a single in- 
dividual, e.g., galvanic skin responses under 
various conditions may be assessed relative 
to a base level characteristic of the in- 
dividual. A matrix of such scores, how- 
ever, would be as uncentered as a matrix 
of raw performance scores. Hence, double- 
standardization is necessary, regardless of 
whether behavior is assessed normatively 
or ipsatively initially, before similar results 
can be obtained in R and QO analysis 
techniques. 








Scori 


in developing a taxonomy of group 
behavior or in establishing the param- 
eters along which relationships between 
performances of individuals vary. 
Normative personological factors, on 
the other hand, describe common pat- 
terns of intrapsychic variations in 
ability. Isolation of these factors is 
an appropriate goal for inquiries 
aimed at establishing the parameters 
of intrapsychic functioning. 

In view of this, it is interesting to 
note that it is current practice in psy- 
chological research to make inferences 
concerning the organization of intra- 
psychic functioning from normative 
group factors (Burt, 1946; Thurstone, 
1938). According to the arguments 
advanced in this paper, this practice 
is unwarranted and should lead to a 
considerably different theory of hu- 
man functioning than would inferences 
drawn from normative personological 
factors. By the same token, experi- 
mental hypotheses may fail to receive 
empirical support if the data are not 
expressed in scores appropriate to the 
hypotheses. 


An R Analysis of Cognitive Data Ex- 

pressed in Ipsative Scores 

The theoretical and empirical con- 
sequences of using different types of 
scores may be illustrated by the 
writer’s experience with actual data 
from a battery of tests administered 
to 35 adult male members of a local 
church. These data were initially ap- 
proached with hypotheses derived from 
the writer’s previous work with two 
cognitive styles (Broverman, 1960a; 
1960b). This work had indicated that 
individuals have stylistic tendencies to 
develop specialized abilities in either 
(a) novel or difficult conceptual be- 
haviors, or (b) novel or difficult 
perceptual-motor behaviors (“Concep- 
tual versus Perceptual-motor Domi- 
nance”). Accordingly, it was hypothe- 


TRANSFORMATIONS IN O AND R TECHNIQUES 


~I 
on 


sized that an R analysis of the test 
battery, which included several of each 
of the above mentioned classes of 
tasks, should produce a factor defined 
by inversely related difficult concep- 
tual and perceptual-motor tasks. 

The previous work had also shown 
that individuals vary, independently 
of the above style, in their ability to 
perform simple highly practiced tasks 
regardless of whether they are con- 
ceptual or perceptual-motor in nature 
(“Strong versus Weak Automatiza- 
tion”). Thus, a second factor defined 
by simple, automatized conceptual and 
perceptual-motor tasks was also pre- 
dicted. However, the raw score R 
analysis of these data, reported by 
Podell and Phillips (1959), produced 
four independent FR factors, none of 
which particularly resembled the pre- 
dicted factors and so failed to support 
the above hypotheses. 

Reconsidering the problem, it was 
noted that Ss had been consistently 


used as their own controls in the 
previous work on cognitive styles. 
Hence, the differences observed be- 
tween variously classified Ss were 
actually based on intergroup com- 


parisons of intra-individual variations 
in ability. This observation suggested 
that the factor hypotheses should be 
reformulated in intrapsychic or ipsa- 
tive terms. Thus, it was hypothesized 
that the previously described factors 
would be obtained from an R analysis 
of ipsative test scores. Table 8 shows 
the factors obtained by this analysis. 
The first ipsative factor is defined 
positively by Thurstone’s PMA Verbal 
Meaning Subtest, Nonsense Fluency 
Production, and Nonsense Fluency 
Order,® all of which may be reasonably 
termed novel or difficult conceptual 
behaviors; and negatively by a ratio 


5 Fuller descriptions of these tests may be 
found in Podell and Phillips (1959) and 
3rooks and Phillips (1958). 





M. 





DONALD 





BROVERMAN 





TABLE 8 


Test VARIABLES AND AUGMENTED Factor LoapinGs CALCULATED FROM 
IPSATIVE SCORES 


Tests 


Nonsense Fluency, Number Correct 
PMA, Verbal Meaning Subtest 
Nonsense Fluency, Order 

Word Construction, Speed 

PMA, Reasoning Subtest 

Stroop, Reading Color Names 
rhurstone Gottschaldt 
Goodenough Draw-a-Man 

Porteus Mazes, Number Wrong Alleys 
Subtractions 

Witkin Gottschaldt 


Toothpick Design Construction, Number Produced Correctly 


Card Sorting, Alphabetical Card Sorting 
Stroop, Naming Color Hues 

Stroop, Word-Color Interference 
Tapping Speed 

Object Naming Speed 

Additions 


Card Sorting Ratio, Alphabetical /Nonalphabetical 


15 Clues, Number for Correct Solution 


Thurstone Two Hand Coordinator, Sum Both Hands 


Simultaneously 
Wer hsler Blocks 
PMA, Space Subtest 
loothpic k Designs, % Correct 


Thurstone Two Hand Coordinator, Ratio Both 


Simultaneously /Sum Each Separately 


Note.—Decimals omitted 

measure derived from Thurstone’s Two 
Hand Coordinator, Accuracy of Tooth- 
pick Design Construction, and Thur- 
stone’s PMA Space Subtest. These 
latter tests are clearly novel or dif- 
ficult perceptual-motor tasks. Thus, 
this factor corresponds closely to the 
first predicted factor. 

The second ipsative factor is de- 
fined positively by doing 
simple subtractions, speed of naming 
color hues, speed of reading the Stroop 
Word-Color Interference Card, speed 
of naming familiar objects, speed of 


speed of 


doing simple additions, tapping speed, 
reading color names 
Qf these tests only the 


speed of 
Fn. 5). 

W ord-Color Interference Test does not, 
seem classifiable as a 


and 


{ see 


on the face of it 





hy he hs 
994 — 100 000 
981 —091 168 
972 — 220 —078 
748 226 —622 
665 — 661 — 347 
635 637 435 
565 — 650 — 507 
426 — 642 — 636 
091 —953 — 287 
000 994 — 102 
000 —992 —119 
000 —635 —771 
000 509 860 
—078 950 — 299 
—203 922 328 
— 332 751 — 220 
—402 915 000 
—423 895 136 
—105 — 164 — 980 
—379 —724 —575 
— 382 — 237 892 
— 302 —949 083 
—741 670 000 
—924 —274 — 264 
—938 —319 133 


simple, automatized, conceptual or per- 
ceptual-motor _ task. the 
writer has previously interpreted per- 
formance on this card (Broverman, 
1960b) as a function of the ability to 
make perceptual discriminations which 
should be a highly practiced, over- 
learned ability. The positively loaded 
tests defining this factor, then, support 
the hypothesis concerning automatized 
behaviors. Thus, while this analysis 
also delineated previously unsuspected 
classes of behavior, i.e., the negatively 
defining tests of the second factor and 
a small third factor, the results, in 
general, were in close agreement with 
the notions developed from the work 
with cognitive styles, whereas the re- 
sults from the R analysis of raw 


However, 

















ScoRE TRANSFORMATIONS IN QO AND R TECHNIQUES 


scores was not. Clearly, in this case, 
a successful outcome depended upon 
the use of appropriate scores. 


Singly versus Doubly Defined Factors 


The normative personological fac- 
tors described above differ from the 
normative group factors of ability 
usually reported in the literature in 
another important aspect. Group fac- 
tors of ability are typically defined 
only by positively loaded tests. The 
normative personological factors shown 
in Table 8, on the other hand, are 
defined with approximately equal 
strength by both negatively and posi- 
tively loading tests. This difference 
in results will be discussed on two 
levels: (a) the numerical mechanics 
underlying the difference and (b) its 
implications for psychological theory. 

The reason normative group ability 
factors are seldom defined both posi- 
tively and negatively is as follows: a 
given group of Ss will usually contain 
wide individual differences in general 
In this case, the in- 


levels of ability. 
dividual mean levels of performance 
are distributed above and below the 


group mean. In such a sample, it is 
possible for the performances of a 
bright S on Tests A and B to be 
negatively related intrapsychically, and 
yet both performances may still be 
well above the group mean. Similarly, a 
dull S with the same inverse intrapsy- 
chic pattern of relationships between 
Tests A and B might give performances 
on both tests considerably below the 
group mean. Thus, the performances 
of the bright and dull Ss contribute 
to a positive rather than a negative 
group relationship between these tests. 
Only if the mean level of performance 
of each S happens to correspond with 
the mean level of the group perform- 
ance (in which case all Ss would have 
the same general level of ability) would 
the inverse intra-individual relation- 


ships of Tests A and B occur as an 
inverse relationship for the group. 
Thus the presence of a G factor within 
a group acts to prevent the occurrence 
of both positively and negatively de- 
fined normative group factors. 

In a previous paper (Broverman, 
1960a) it was pointed out that ipsative 
scores may be used to control for inter- 
individual differences in G level. Pre- 
sumably, a person’s general level of 
performance on a large battery of 
tests will vary as a function of his 
general intellectual endowment. Hence, 
deviations of particular tasks above or 
below the person’s general or mean 
level of performance may be con- 
sidered ability variations independent 
of his G level. The use of ipsative 
scores in an analysis, then, may be 
considered a form of control for inter- 
individual variations in G level. 

It should be noted that extraction 
of a general group factor from raw 
scores does not control G level in the 
same manner as ipsative scores. A 
G factor is based on correlations or 
covariances common to all columns. 
Extraction of this factor leaves the 
correlations specific to certain columns 
as residuals. However, these residual 
correlations are still based on norma- 
tive scores and reflect normative group 
relationships which are independent of 
the G factor. As pointed out earlier, 
residuals cannot contain information 
not included in the original correla- 
tions. Since the status of a particular 
column score in an individual’s profile 
distribution does not enter into the 
correlation between columns, this in- 
formation can not be contained in a 
residual correlation. Ipsative scores, 
on the other hand, are based on pre- 
cisely this information. Hence, these 
two methods of dealing with G are 
not comparable. 

Control of differences in G through 
the use of ipsative scores, then, per- 





mits inverse relationships between tests 
within the individuals composing the 
group to emerge as positively and 
negatively defined normative person- 
ological factors. 

The implications of this difference 
in factor definition for theories of 
ability are considerable. Previous 
theorists have interpreted the occur- 
rence of normative group factors de- 
fined exclusively by positively loading 
tests as an indication that intrapsychic 
organization consists of a number of 
independent dimensions on which Ss 
differ only by being more or less pro- 
ficient. Development, or an increase 
in the abilities of a given individual, 
then, simply implied increments in 
proficiency on the various dimensions 
of psychological functioning. Hence, 
development is treated in exclusively 
positive terms from this standpoint. 

On the other hand, the occurrence 
of normative personological factors 
which are both positively and nega- 
tively defined that certain 
abilities tend to be antagonistically re- 
lated within the individual. An in- 
crease in ability in one class of be- 
haviors apparently inhibits or lowers 
the level of ability in another be- 
havioral thus producing the 
inverse relationship. Of course, des- 
ignation of one end of a doubly de- 
fined factor as positive or negative is 
arbitrary on the part of the factor 
Sehaviors at a given end of 

defined factor represent 
dominant abilities for some Ss and 
subordinate abilities for others. In 
other words, some Ss will be stronger 
on the behaviors at one end of a factor, 
other Ss stronger in the behaviors at 
the opposite end. Each doubly defined 
ability factor, then, may represent a 
“choice point” for the growing in- 
dividual, i.e., at some point in his de- 
velopment a “decision” must be made 
on each bipolar factor as to whether 


suggest 


class 


analyst. 
a doubly 


is DoNALD M., 


SROVERMAN 


he is to specialize in one or the other 
class of behaviors. Development 
viewed from this standpoint is a series 
of nodal points or forks in the road 
leading to specialization in one class 
of behaviors at the expense of another. 
Thus development is conceptualized in 
both positive and negative terms. It 
might be noted that this interpretation 
is similar to the notion of specializa- 
tion in development presented by 
Werner (1957). Other investigators 
(Cattell & Baggaley, 1956; Sweney, 
1959) have interpreted this specializa- 
tion or directional aspect of ipsative 
scores as an index of motive strength. 

Issues related to normative versus 
ipsative scores, then, include dif- 
ferences in results, differences in the 
type of inference possible from the 
results, and consequently, differences 
in the theoretical implications of the 
data. These differences, obviously, 
are of the greatest magnitude for re- 
search in psychology. Investigators 
employing factor analysis as a research 
tool should seriously consider whether 
the scores they employ are appropriate 
to their hypotheses, goals, or interests, 
if they are to avoid unwarranted con- 
clusions, and unnecessary 
failure. 


pe »ssibly 


SUMMARY 


Although long a point of controversy, 
it is now generally maintained that 
factor solutions obtained from a Q 
analysis (correlating people) are es- 
sentially transpositions of factors pro- 
duced by R analysis (correlating 
tests). The present paper takes issue 
with this position by presenting new 
arguments and empirical evidence of 
important differences between the two 
techniques, as they are commonly em- 
ployed. The usual Q analysis is shown 
to involve, implicitly, a double cen- 
tered matrix, i.e., a matrix in which 
all columns and all rows have the same 











ScorE TRANSFORMATIONS IN Q AND R TECHNIQUES 79 


mean. In line with Burt’s mathe- 
matical demonstration (1937) such a 
matrix is shown to produce equivalent 
factors by either technique. However, 
these solutions are shown to differ 
from the usual R analysis which, typi- 
cally, employs a single centered ma- 
trix. As commonly performed, then, 
the two techniques give different fac- 
tor solutions. 

The above, however, suggests that 
the differences in factor solutions are 
a function of single versus double cen- 
tered matrices rather than the direction 
of correlation. In turn, it is shown 
that the two matrices differ by employ- 
ing different types of scores. Thus, 
the single-centered matrices custom- 
arily employed in R technique usually 
employ either raw data or standard- 
ized columns of test scores. In either 
case these scores, when correlated, are 
“normative” in that they reflect inter- 
individual differences in terms of group 
norms. Similarly, relationships ob- 
tained from these scores are expressed 
with respect to group norms. Factors 
derived from no.mative scores, then, 
characterize consistencies within the 
performances of a group. Accord- 
ingly, it is proposed that such factors 
be termed “normative group” factors. 

Double centered matrices, on the 
other hand, involve standardization of 
scores along rows as well as columns. 
Standardizing the rows of scores for 
each person produces “ipsative” scores, 
or scores which are scaled about intra- 
individual norms. Ipsative scores re- 
flect intra-individual variations in abil- 
ity. Accordingly, factors derived from 
ipsative scores represent patterns of 
intrapsychic ability organization com- 
mon to the Ss composing the group. 
It is proposed, therefore, that such 
factors be termed “normative person- 
ological” factors. 

It is suggested that normative group 
factors are appropriate for inquiries 


aimed at establishing patterns of group 
behavior. Such information would be 
useful for problems involving group 
assessment, management, or predic- 
tion. On the other hand, isolation of 
normative personological factors seems 
the appropriate goal for inquiries 
aimed at establishing parameters of 
intrapsychic organization. From this 
standpoint, the usual procedures of 
making inferences concerning the or- 
ganization of intrapsychic functioning, 
e.g., Thurstone’s Primary Mental Abil- 
ities (1938), from normative group 
factors is unwarranted and should lead 
to different conceptions of human 
functioning than would _ inferences 
drawn from normative personological 
factors. 
REFERENCES 
BABINGTON-SMITH, B. Factor analysis: II. 
An evaluation of factor analysis from 
the point of view of a psychologist. J. 
Roy. Statist. Soc., Series B, 1950, 12, 73- 
85. 


3RooKS, Marjorie O., & Puituirps, L. The 


cognitive significance of Rorschach de- 
velopmental scores. J. Pers., 1958, 26, 
268-290. 

BrovERMAN, D. M. Cognitive style and 
intraindividual variation in abilities. J. 
Pers., 1960, 28, 240-256. (a) 

BroveRMAN, D. M. Dimensions of cogni- 


tive style. J. Pers., 1960, 28, 167-185. (b) 

Burt, C. L. Correlations between persons. 
Brit. J. Psychol., 1937, 28, 59-96. 

Burt, C. L. The factors of the mind. 
London: London Univer. Press, 1946. 
CaTTett, R. B. Psychological measure- 
ment: Ipsative, normative, and interactive. 
Psychol. Rev., 1944, 51, 292-303. 
Catrett, R. B. Factor analysis. 

York: Harper, 1952. (a) 

CaTTreLL, R. B. The three basic factor 
analytic research designs: Their inter- 
correlations and derivatives. Psychol. 
Bull., 1952, 49, 499-520. (b) 

CaTTeLt, R. B., & Baccatey, A. R. The 
objective measurement of attitude mo- 
tivation: Development and evaluation of 
principles and devices. J. Pers., 1956, 
24, 401-423. 

Eysenck, H. J. The structure of human 
personality. London: Methuen, 1953. 


New 





80 DonaLp M 


Povett, J. E.. & Puts, L. A develop- 
mental analysis of cognition as observed 
in dimensions of Rorschach and objective 
test performance. J. Pers., 1959, 27, 
439-463 

STEPHENSON, W. Introduction to inverted 
factor analysis with some applications to 
studies in orexis. J. educ. Psychol., 1936, 
27, 353-367. 


STEPHENSON, W. Some observations of Q 


technique. Psychol. Bull., 1952, 49, 
483-498. 
StepHenson, W. The study of behavior. 


Chicago: Chicago Univer. Press, 1953 
Sweney, A. B. The objective measure- 
ment of motivation in children. Amer. 


Psychologist, 1959, 14, 355. (Abstract) 


. BROVERMAN 


Tuomson, G. H. 
human ability. 
Press, 1939. 


The factorial analysis of 
London: London Univer. 


TuHurstone, L. L. Primary mental abil- 
ities. Chicago: Chicago Univer. Press, 
1938. 

Tryon, R. C. Cumulative communality 


cluster analysis. 
1958, 18, 3-35. 

WernNER, H. The concept of development 
from a comparative and organismic point 
of view. In D. B. Harris (Ed.), The 
concept of development. Minneapolis: 
Univer. Minnesota Press, 1957. 


Educ. psychol. Measmt., 


(Received November 27, 1959) 











THE BRITISH JOURNAL 
OF PSYCHOLOGY 


Edited by Boris SEMEONOFF 


Vol. 51 Part 4 November, 1960 20s. net 


T. H. PEAR. The accuracy of ‘reproduced’ conversation. 


NORMAN F. DIXON. Apparent changes in the visual threshold : 
central or peripheral? 


SATORU AIBA. The effects of stimulant and depressant drugs 
on the Bidwell phenomenon. 


R, LYNN. Extraversion, reminiscence and satiation effects. 


D. W. FORREST. Association between muscular tension and 
work output. 


R. G. CHATTERJEA. The effect of distraction on time gap in the 
estimation of short duration. 


HARRY M. B. HURWITZ. The effect of illumination conditions 
on the effectiveness of lisht-onset as a reinforcer: a test of the 
Bevan-Adamson reinforcement theory. 


D. F. BROOK and A. W. HEIM. A preliminary note on the Brook 
reaction test. 


CHRISTOPHER ALEXANDER. A result in visual aesthetics. 
PUBLICATIONS RECENTLY RECEIVED. 





The subscription price per volume, payable in advance, 
is 60s. net (post free). 





Subscriptions may be sent to any bookseller or to the 


CAMBRIDGE UNIVERSITY PRESS 
Bentley House, Euston Road, London, N. W. 1 














Shor? CORE SERIES 


Elements 


of Psychology 


by DAVID KRECH and RICHARD S. CRUTCHFIELD 
University of California, Berkeley 


“In its coverage of the major topics 
of psychology, there is depth as well 
as breadth. All material is lucidly or- 
ganized and intelligently integrated. 
Sketches and diagrams are used freely, 
skillfully, and tastefully to illustrate 
empirical findings, details of experi- 
mental apparatus, and neurologic struc- 


ture relevant to perception and emo- 
tion.”—Psychiatric Quarterly. 758 pp. 
Illus. $7.00. 


The Stupemr Worksoox by James 
L. McGaugh is a valuable supplement 
to Elements of Psychology. An Alter- 
nate Edition of the Instructor's Man- 
UAL is also now available. 


ALFRED A. KNOPF, Publisher 


College Department 


501 Madison Avenue 


New York 22 








